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FOREWORD 
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1  INTRODUCTION 


In  recent  years,  there  has  been  much  interest  in  developing  practical  logical  systems  which 
can  be  used  to  produce  probabilistic  inferences  from  probabilistic  evidential  information.  The 
designs  for  such  systems  are  generally  hybrid  concepts  based  upon  the  well-known  theories 
of  probability  and  first-order  predicate  logic.  Some  of  these  inference  systems  also  include 
the  probabilistic  effects  of  influences  between  logical  variables  using  network  models.  These 
networks  are  usually  represented  by  acyclic  directed  graphs  in  which  each  node  corresponds 
to  a  random  logical  variable  that  is  associated  with  the  application  domain  of  interest  and 
each  arc  expresses  the  “influence”  of  one  variable  upon  another.  Various  schemes  -  which  are 
usually  based  upon  a  Bayesian  probability  calculus  -  are  used  to  evaluate  the  effects  of  influences 
and  to  obtain  resultant  probabilities  for  key  inferences  that  may  be  logically  constructed  from 
the  system’s  variables.  Since  realistic  inference  systems  tend  to  include  many  variables  and 
influences,  maintaining  a  self-consistent  system  model  CEin  be  difficult,  especially  in  dynamic 
and  collaborative  environments  where  frequent  changes  from  multiple  sources  are  made  to  the 
system. 

The  Dirac  algebra  of  quantum  mechanics  has  been  developed  by  physicists  for  the  purpose  of 
encoding  for  calculation  the  probabilistic  nature  and  uncertainty  characteristics  intrinsic  to  the 
quantum  physical  world.  The  fact  that  probabilistic  inference  systems  possess  characteristics 
analogous  to  these  suggests  that  the  Dirac  algebra  can  also  be  applied  to  inference  systems 
for  encoding  and  calculational  purposes.  This  report  demonstrates  that  the  Dirac  algebra  does 
indeed  provide  a  natural  and  efficient  setting  for  the  algebraic  representation  and  processing  of 
probabilistic  inference  systems.  Dirac  algebra  based  inference  systems  are  called  Dirac  networks 
and  show  that  such  systems:  (1)  provide  for  a  self  consistent  and  efficient  encoding  of  the 
probabilities  and  distributions  for  all  possible  combinations  of  truth  values  for  the  system’s 
logical  variables;  (2)  can  utilize  imitary  rotation,  time  evolution,  and  translation  operators  to 
modify  probabilities  and  distributions  for  variables  that  are  induced  by  the  influences  of  other 
variables  and  by  time;  (3)  permit  unambiguous  linear  and  controlled  cyclic  cause  and  effect 
relationships  through  the  use  of  operator  order  indices  and  an  ordering  operator-,  (4)  guarantee 
that  the  system  is  always  normalized;  (5)  enable  the  straightforward  evaluation  of  a  meaningful 
informational  uncertainty  measure  defined  in  terms  of  the  von  Neumann  entropy,  and  (6)  allow 
for  a  variety  of  “measurement”  operators  that  are  useful  for  quantifying  probabilistic  inferences 
that  may  be  derived  from  the  system. 

The  richness  and  self-consistent  nature  of  Dirac  networks  suggests  its  general  applicability 
to  such  diverse  areas  as  data  fusion  and  analysis,  dynamic  resource  allocation,  qualitative  anal¬ 
ysis  of  complex  systems,  automated  medical  diagnostics,  and  interactive/collaborative  decision 
processes.  As  is  well  known,  the  computational  complexity  that  is  associated  with  this  class 
of  inference  problem  is  formidable  [3],  [4].  The  implementation  of  such  applications  as  Dirac 
networks  using  conventional  software/hardware  engineering  techniques  does  not  alleviate  this 
computational  complexity  problem.  However,  the  fact  that  the  associated  Dirac  networks  have 
their  system  speciflcations  written  in  the  language  of  quantum  mechanics  suggests  that  quantum 
mechanical  implementations  which  render  the  computational  complexity  as  “polynomial  time” 
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might  eventually  be  possible  (recall  that  Shor’s  [11]  quantum  mechanical  algorithm  transforms 
the  computational  intractability  of  finding  the  prime  factors  of  an  integer  on  a  conventional 
computer  into  a  polynomial  time  computation  on  quantum  mechanical  computer). 

This  report  is  divided  into  four  major  sections.  The  first  section  describes  how  the  Dirac 
algebra  can  be  used  to  define,  modify,  and  “measure”  truth  state  vectors  for  probabilistic  in¬ 
ference  systems.  Application  of  the  algebraic  theory  to  include,  manipulate,  and  “measure” 
general  probability  distributions  for  inference  system  variables  is  discussed  in  the  second  sec¬ 
tion.  The  third  major  section  is  concerned  with  demonstrating  the  versatility  of  the  approach 
by  applying  it  to  several  representative  classes  of  problems.  The  fourth  section  initiates  a  brief 
discussion  concerning  how  certain  components  of  Dirac  networks  can  be  implemented  as  optical 
devices. 

It  should  be  noted  that  this  paper  is  intended  to  be  as  algebraically  self-contained  as  is 
feasible.  In  keeping  with  this,  many  simple  examples  are  used  to  illustrate  results  as  they  are 
developed.  In  addition,  proofs  to  theorems  are  included  to  further  aid  the  reader’s  imderstand- 
ing  of  the  Dirac  algebra.  However,  should  additional  background  material  be  required,  the 
reader  is  urged  to  consult  several  of  the  excellent  contemporary  textbooks  [1,  2]  which  develop 
modern  quantum  mechanics  from  a  Dirac  algebraic  perspective. 
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2  TRUTH  STATE  VECTORS  AND  OPERATORS  FOR  PROBABILISTIC 

INFERENCE  SYSTEMS 


2.1  THE  TRUTH  STATE  FOR  A  LOGICAL  VARIABLE 


A  two-dimensional  truth  vector  space  can  be  assigned  to  each  logical  variable  in  an  inference 
system.  The  truth  vector  space  assigned  to  the  variable  is  spanned  by  the  orthonormal  Dirac 
ket  basis  vectors  \  +)j  and  |  —)j  which  represent  the  true  (-(-)  and  false  (— )  values  that  can  be 
acquired  by  the  variable  (it  is  understood  that  ±  means  ±1),  respectively.  For  each  such 
space,  there  also  exists  a  dual  (or  adjoint)  space  spanned  by  the  Dirac  bra  basis  vectors  j{+  \ 
and  j{—  |.  Scalar  products  can  be  formed  from  these  bra-ket  vectors  according  to  the  following 
“multiplication”  rules: 

I 

and 

j(±  I  T)j  =0- 

These  rules  clearly  exhibit  the  orthonormality  property  of  the  truth  basis  vectors  and  can  be 
represented  more  compactly  by  the  following  single  expression: 

j  (u)  I  U))j  — 

where  u),uj'  €  {+,  — }  and  is  the  Kronecker  delta  defined  by: 


6 


(jj'uj  — 


1  when  u)'  —  u) 
0  when  lj'  ^  u: 


The  general  truth  state  for  the  variable  is  a  ket  vector  |  9j)  that  is  a  linear  superposition 
of  the  associated  truth  ket  basis  vectors  |  ±)j.  Specifically,  the  truth  state  for  the  variable 
is  defined  by 

I  9j)  =  cos9j  I  -l-)j  -l-sin^j  |  -)j, 

where,  in  order  to  avoid  logical  ambiguities,  it  is  required  that  0  <  <  f .  Here  the  coefficients 

cos  9j  and  sin  9j  are  the  probability  amplitudes  associated  with  the  true  and  false  values  for  the 
variable,  respectively.  The  squares  of  these  amplitudes  are  the  probabilities  of  the  truth 
values  for  the  variable.  Thus,  cos^  9j  is  the  probability  that  the  truth  value  for  the  variable  is 
“true”  and  sin^  9j  (=  1  —  cos^  9j)  is  the  probability  that  the  truth  value  is  “false.”  For  example, 
if  9j  =  0  [^),  then  the  associated  variable  is  precisely  true  (false).  More  generally,  suppose  that 
the  system  variable  is  the  assertion  “it  will  rain  tonight”  with  9j  —  Then  the  truth  state 
for  this  assertion  is  the  ket  ^  [-1-)^  -!-  ^  j— so  that  the  probabihty  that  the  assertion 

is  true  is  =  |  and  the  probability  that  it  is  false  is  ^5^  = 
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A  truth  state  \9j)  is  normalized  if  it  forms  a  unit  scalar  product  with  its  dual  (adjoint)  bra 
vector  defined  by 

{djl  =  cos djj  (-f-l  +  sin  Ojj  (-1 . 

Since  scalar  products  are  frequently  used  below,  it  is  suggested  that  the  details  of  the  proof 
of  the  following  lemma  may  be  of  pedagogical  value  to  the  reader  who  is  unfamiliar  with  the 
mechanics  of  the  algebraic  manipulation  of  bra  and  ket  vectors. 

Lemma  1  The  truth  state  for  any  variable  is  normalized. 


Proof: 

{Sj  I  Oj)  =  {cosdj  I  +sm6j  j(-  i)(cos0j  |  +)j  +  sin^j  |  -)j) 
=  cos^  6j  j(+  I  +  cos  9j  sin  9j  j{+  \  -)j+ 
sin  9j  cos  9j  j(—  |  -b)j  +  sin^  j{~  I  ~)j 
=  cos^  9j  +  sin^  9j  =  1. 


Q.E.D. 


This  result  is  readily  illustrated  using  the  truth  state  for  the  above  example: 


Bra  and  ket  vectors  admit  matrix  representations  which  are  especially  useful  for  compu¬ 
tational  purposes.  In  particular,  the  ket  basis  vectors  |-r)j  and  |— )j-  are  represented  by  the 
column  matrices 


respectively.  The  adjoint  of  a  matrix  (denoted  by  a  superscript  f)  is  the  complex  conjugate 
(denoted  by  a  superscript  *)  of  its  transpose.  Using  this  -  along  with  the  fact  that  the  matrix 
representation  for  a  bra  vector  is  the  adjoint  of  the  matrix  representation  for  the  associated  ket 
vector-  yields  the  following  row  matrix  representations  for  the  bra  vectors  j  (-l-|  and  j  (— |: 


and 


It  is  easy  to  see  from  this  and  the  definition  of  the  truth  state  that  \9j)  and  its  dual  {9j\  can 
be  represented  as  matrices  by 


(cos  9j 
sin  9j 


=  cos  9j 


1 

0 


+  sin  9j 
i 


0 

1 


j 
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and 

^  cos  $j  sin  6j  j  =  cos  Ojj  ^1  0  ^  +  sin  ^  0  1  j  , 

respectively.  We  shall  use  the  bra-ket  notation  to  stand  for  both  vectors  and  their  matrix 
representations. 

2.2  THE  TRUTH  OPERATOR  FOR  A  LOGICAL  VARIABLE  AND  ITS  MEASUREMENT 

A  truth  operator  fj  can  also  be  associated  with  the  logical  variable  of  an  inference  system. 
The  ket  basis  vectors  |  ±)j  are  eigenvectors  of  this  truth  operator  and  they  satisfy  the  eigenvalue 
equation  given  by 

I  ±)i  =  ±  i  ±)i- 

Thus,  the  eigenvalues  ±  (i.e.,  ±1)  can  be  interpreted  as  the  result  of  “truth  measurements” 
obtained  from  the  action  of  fj  upon  the  associated  basis  eigenkets.  This  notion  can  be  extended 
to  a  more  generalized  measure  of  the  truth  value  for  a  variable’s  truth  state.  This  quantity  is 
the  expected  truth  value  for  a  variable’s  truth  state  and  is  defined  for  the  variable  by  the 
product  {fj)  =  {6j  I  fj  1  Bj).  Obviously,  for  the  special  case  that  Bj  —  0  (^),  then 

{rj)  =  j  (+  i  Tj  I  +)j.  (j  (-  I  fj  1  -)j.) 

=  j  (+  I  +1  1  +)j  (j  (-  I  -1  I  -)j) 

=  +lj  (-r  I  4-)^  (-Ij  (-  I  -)^ 

=  41  (-1) 

and  the  expected  truth  value  is  precisely  true  (false).  The  expected  truth  value  of  a  general 
truth  state  for  a  variable  can  easily  be  calculated  using  the  following  lemma. 

Lemma  2  {fj)  =  cos^  Bj  —  sin^  Bj. 

Proof: 

{fj)  ^  {cosBjj{+  I  4sin%(-  \)fj{cosBj  \  +)j^siuBj  \  -)j) 

=  cos^  Bj  j{+  I  fj  I  -\-)j  -‘rCosBjsmBjj{+  \  fj  \  —)j  4 
sinBj  cosBjj  (— |fj  \+)j  +  sin^Bjj  {—\fj  |— )j 

=  cos^  ^jj{+  I  4)j  —  cosBj  smBjj{+  \  —)j  4 
sin  Bj  cosBj  j  {—  \  4-)^  -  sin^  Bjj  {-  \  -)j 
=  cos^  Bj  —  sin^  Bj. 


Q.E.D. 

The  expected  truth  value  for  a  variable  is  therefore  the  difference  between  the  probability 
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that  the  variable  is  true  and  the  probability  that  it  is  false  so  that 


-1  <  (fj)  <  +1- 


Thus,  when  (fj)  >  0  (  <  0),  the  variable  is  more  true  (false)  than  false  (true)  and  when  {fj)  =  0, 
it  is  as  precisely  as  true  as  it  is  false.  This  obviously  occurs  when  9j  =  |  and  represents  a  state 
of  complete  uncertainty  about  the  truth  for  the  associated  variable.  Referring  to  the  truth 
state  for  the  previous  example  assertion  “it  will  rain  tonight”  and  applying  the  last  lemma,  it 
is  readily  found  that  its  expected  truth  value  is  |  —  |  =  ^  •  Hence,  the  truth  state  for  this 
assertion  represents  the  fact  that  the  assertion  is  more  true  than  false. 


The  truth  operator  also  admits  a  computationally  useful  matrix  representation  defined  by: 


(  j  (  M  I  '  )j  j  (  ‘  I  I  )j  \ 

ji-lT-jHj  ) 


Observe  that  -as  required-  the  truth  eigenvalues  are  the  diagonal  entries  of  the  matrix.  Here, 
we  shall  adopt  the  general  convention  that  the  omission  of  the  symbol  from  any  operator 
signifies  the  use  of  its  matrix  representation.  Also,  if  such  an  omission  occurs  in  an  algebraic 
expression,  then  the  entire  expression  is  assumed  to  be  in  matrix  form.  A  simple  example  of 
this  which  involves  the  truth  operator  is 


{9j\  Tj  \9j)  =  (  cos6»^- 


An  operator  O  is  Hermitean  if  its  matrix  representation  is  self-adjoint,  i.e.  if  its  matrix 
representation  is  equal  to  the  complex  conjugate  transpose  of  itself.  In  this  case  we  write 
0  =  61 


Lemma  3  The  truth  operator  for  a  logical  variable  is  Hermitean. 

Proof: 

Thus,  fj  =  fJ.  Q.E.D. 

It  is  useful  at  this  time  to  introduce  the  identity  operator  Ij  for  the  logical  variable. 
This  operator  is  defined  by  its  action  !_,■  \9j)  =  \9j)  upon  a  truth  state  and  its  juxtapositional 
property  ijfj  =  fjlj  =  fj  with  the  truth  operator.  It  is  easy  to  see  that  the  identity  operator 
has  the  following  anticipated  matrix  representation: 
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It  is  also  clear  from  this  that  the  identity  operator  is  Hermitean.  We  note  that  the  symbol  1 
will  be  used  to  denote  a  general  identity  operator  so  that,  by  convention,  1  is  its  associated 
matrix  representation.  Although  “1”  also  denotes  the  unit  scalar,  the  meaning  of  “1”  will  be 
clear  from  the  context  in  which  it  is  used. 

The  truth  uncertainty  Atj  for  the  logical  variable  is  the  quantity 

Atj  = 

which  provides  a  measure  of  the  uncertainty  -  or  “fuzziness”  -  associated  with  the  expected 
truth  such  that  the  smaller  (greater)  the  Atj  value,  the  more  (less)  certain  the  (f  j)  value.  The 
truth  uncertainty  is  easily  calculated  using  the  result  of  the  following  theorem. 

Theorem  4  Arj  =  2sm6jCos0j. 

Proof:  First  observe  that  =  ij  (this  is  easily  determined  using  the  matrix  representation 
for  fj).  Then  (r'fj  =  1  so  that 

Atj  =  —  (cos^  6 j  —  sin^  Oj)'^ 

=  4  sin^  9j  cos^  9j 

—  2sm9jCOs9j. 


Q.E.D. 

Thus,  not  only  is  the  expected  truth  value  (fj)  for  variable  j  completely  determined  by  9j, 
but  so  is  Atj.  Note  that  -  as  required:  (i)  when  9j  — *•  0,  f ,  then  Atj  — ►  0  and  it  becomes  more 
certain  that  the  truth  value  is  (fj);  (ii)  when  9j  =  0,  then  At  j  —  0  and  {fj)  =  +1,  —1  (i.e., 
the  truth  value  is  -  with  total  certainty  -  precisely  an  eigenvalue),  respectively;  and  (iii)  when 
9j  =  J,  then  {fj)  =  0  and  the  truth  uncertainty  has  its  greatest  value  Arj  =  1. 


2.3  TRUTH  STATES  FOR  MULTI- VARIABLE  SYSTEMS 

The  truth  state  vector  j'®)  for  a  sj^tem  comprised  of  n  variables  is  the  tensor  product  of 
the  n  single  variable  truth  states.  This  tensor  product  state  is  written  in  Dirac  notation  as  the 
juxta-positions 

and  corresponds  to  a  vector  in  the  2"  dimensional  vector  space  spanned  by  the  orthonormal  set 
of  basis  ket  vectors  which  correspond  to  the  set  of  all  possible  truth  value  combinations  for  the  n 
variables  (one  tensor  product  basis  vector  represents  one  possible  combination).  Thus,  |5')  is  a 
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superposition  of  these  basis  vectors  in  which  each  vector  coefficient  encodes  an  amplitude  whose 
square  is  the  probability  for  the  logical  variables’  truth  values  represented  by  the  associated 
tensor  product  basis  state. 

To  make  this  more  precise,  let  AT  =  {1,2,  -  •  •  ,n}  index  the  variables  for  an  inference  system, 
W  =  {+,  — },  and  define 


=  (a;  =  u}iLJ2  •  •  ‘^71  I  ■ 


(2-1) 


This  is  the  set  of  all  possible  strings  of  -f-’s  and  — ’s  of  length  n.  Using  this,  the  truth  state 
vector  for  a  system  of  n  logical  variables  can  be  written  as  the  superposition 

\^)=  Y.  (2-2) 

(jj€Lp^ 


Here  |  w)  =  |cJiu;2  •  •  ■  Wn)  =|  |  0^2)  •••  |  i^n)  and  f  (ij)  =  h  (wi)  h  (^2)  •  •  •  h{un)  is  the 

probability  amplitude  for  |  u)  with 


h  {ojj) 


cos  dj  when  Uj  =  -I- 
sin  dj  when  uij  =  — 


(2-3) 


Recall  that  -  by  definition  -  scalar  products  can  only  be  formed  between  bra  and  ket  vectors 
for  the  same  logical  variable,  i.e,  between  vectors  in  the  same  space  with  the  same  subscript 
index.  With  this  in  mind,  the  following  lemma  shows  that  {|  a;)  |  w  £  L^}  is  indeed  an  or¬ 
thonormal  set  of  ket  vectors. 


Lemma  5  IfuJ,ui'  £  Lat,  then  {ui'  \  u)  =  8^'^. 


Proof: 


Q.E.D. 


(w'  I  u)  =  (wj  I  a;i)(u;2  |  0^2)  •  •  •  (u;'„  1  u;„) 


=  (5, 


=  8. 


The  matrix  representation  for  |  is  the  usual  2”  x  1  tensor  product  column  matrix  in  which 
each  appropriately  ordered  entry  corresponds  to  the  superposition  amplitude  for  the  associated 
basis  vector.  The  representation  for  the  bra  vector  ('J'|  is  the  analogously  defined  1x2”  row 
matrix. 


The  desired  normalization  for  is  guaranteed  by  the  next  lemma. 

Lemma  6  |5')  is  normalized. 

Proof:  Using  the  facts  that  each  variable’s  truth  state  is  normalized  and  that  scalar  prod¬ 
ucts  are  defined  only  for  variables  with  identical  subscripts,  we  have 


($!$)  =  (01  I  0l)  (02  I  02)  •  •  •  {On  I  Qn)  =  iJ-J,  =  1- 

n  times 
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Q.E.D. 


In  order  to  illustrate  this  construction,  let  variable  j  be  the  above  example  assertion  with 
truth  state  and  let  variable  k  be  the  assertion  “tomorrow  will  be  sunny”  with  truth  state 

Then  the  tensor  product  truth  state  for  this  system  is 


TT 

(^cos-l+)fc  +  sin^|-)it 


(cos^|+). 

“TT  ^(vi\ 

cos  -  cos  -  1+)^  |+)fc  +  cos  -  sin  -  |+)j-  l-)fc  + 


sin  -  cos  -  I  |+)fc  -r  sin  ^  sin  -  | -)j  \ -) ^ 


6  3 

3,,,,  v/3,.,, 

-J-  l+)i  |d-)fc  +  7  |->fc  +  7 \-)j  l+)fc  +  -J-  l-)j  |-)fe 


4 

4 


^  3,  ,  1,  ,  v^i  , 

•+)  +  7  |d — )  +  7  I — h)  +  —  I  )  , 


where  we  have  simplified  the  notation  by  defining  1+)^  1+);^.  =  |t+),  etc.  Observe  that  since 
n  =  2,  then  \^jk)  is  a  2^  =  4  dimensional  truth  state  vector  expressed  as  a  superposition  of 
basis  truth  states  given  by  the  set 


Sjk  =  {!++) ,  |h — )  >  I — ^-)  >  I — )} 

which  corresponds  to  all  possible  combinations  of  “true/false”  values  for  assertions  j  and  k. 
As  above,  when  u)  =  ojjUk  —  4-+,  then  h(uJj)  =  cos^,  h,{u;k)  =  cos|  so  that  /  (w)  = 
h{uj)h{LJk)  =  cos  I  cos -I,  etc.  Choosing  the  basis  state  order  given  in  set  Sjk  as  the  ma¬ 
trix  order,  then  1'5'jfc)  can  be  represented  by  the  column  matrix 


/  f  \ 


= 


V  #  / 


and  similarly  for  the  row  matrix  representation  for 

The  square  of  the  amplitude  for  s  €  Sjk  is  the  probability  for  the  conjunction  of  the  truth 
valued  assertions  associated  with  s.  For  example,  ~  probability  for  |-l — ),  so 

that  ^  is  the  probability  for  the  conjunction  of  assertion  j  when  it  is  “true”  with  assertion  k 
when  it  is  “false”  (i.e.,  “it  is  true  that  it  will  rain  tonight”  and  “it  is  false  that  tomorrow  will  be 
sunny”).  Also,  notice  that  the  sum  of  the  squares  of  any  subset  of  the  amplitudes  for  is 
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the  probability  for  the  disjunction  of  the  conjunction  of  the  associated  truth  valued  assertions. 
For  example,  consider  the  assertions  associated  with  the  basis  states  |H — )  and  | — r).  Then 

(l)  (3)  ~  I  probability  that  “it  is  true  that  it  will  rain  tonight  and  it  is  false  that 

tomorrow  will  be  sunny”  or  “it  is  false  that  it  will  rain  tonight  and  it  is  true  that  tomorrow  will 
be  sunny.”  Finally,  since  the  Sjk  basis  vectors  form  an  orthonormal  set  (i.e.,  {Ip  \  qr)  =  1  when 
I  =  q  and  p  =  r  and  is  0  otherwise,  where  l,p,q,r  G  {+,  — }),  it  is  readily  verified  that  the  last 
lemma  holds.  In  particular. 


As  seen  from  these  considerations,  the  guaranteed  normalization  for  a  truth  state  vector  is 
simply  an  affirmation  of  the  fact  that  the  “event”  described  by  the  disjunction  of  the  conjunction 
of  all  possible  truth  evaluations  (as  defined  by  the  associated  basis  vector  set)  for  the  logical 
variable  assertions  for  an  inference  system  is  the  certain  event  (i.e.,  has  unit  probability). 


2.4  THE  INFLUENCE  OPERATOR  FOR  A  LOGICAL  VARIABLE  AND 
THE  ROTATION  OF  ITS  TRUTH  STATE 

In  a  network  model  for  an  inference  S5^tem  with  influences  between  variables,  the  influence 
of  the  logical  variable  upon  the  logical  variable  is  represented  by  an  arc  from  the  node 
to  the  node  of  the  underlying  directed  graph.  In  this  case  the  node  is  called  the  parent 
node  and  the  node  is  called  the  child  node  and  we  refer  to  them  both  as  a  parent  —  child 
pair.  In  a  Dirac  network  the  influence  of  the  parent  upon  the  child  is  modelled  by  a  rotation 
of  the  truth  state  ket  vector  for  the  child  through  an  angle  ajk  defined  for  the  pair.  This 
rotation  angle  can  be  of  constant  value  or  can  be  functionally  dependent  upon  the  truth  state 
of  the  parent  (this  functional  dependence  capability  provides  a  modelling  flexibility  that  is  not 
typically  available  to  conventional  inference  systems).  Here  the  subscripts  denote  that  parent 
j  influences  child  k  and  we  say  that  “variable  j  exerts  an  ajk  influence  upon  variable  k''  Note 
that  a  variable  j  can  influence  itself  via  an  ajj  influence.  Such  an  influence  is  called  a  self 
influence  and  appears  as  a  directed  loop  on  node  j  in  the  associated  network  model  (clearly, 
this  is  a  cycle).  Also,  if  a  variable  is  not  influenced  by  another,  then  there  is  no  such  rotation 
and  the  variable  is  said  to  be  independent. 

The  influence  described  above  is  achieved  algebraically  through  application  of  the  exponen¬ 
tial  rotation  operator  given  by 

4fc(a,fc)=e'W 

to  the  truth  state  for  variable  fc,  where  dk  is  the  influence  operator  for  variable  k  and  i  = 
\/^.  The  influence  operator  for  variable  k  has  eigenvectors  |x±)fc  which  are  special  linear 
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superpositions  of  the  basis  ket  vectors  l±}fc-  In  particular, 

<Tfc  |x±)fc  =  ±  |x±)fc , 

where 

lx±)fc  =  ^[l+)fc±i|->fc]- 

The  next  two  lemmas  are  easily  verified  from  the  definition  of  |Xi;)j  stated  without 

proof. 

Lemma  7 


and 


Lemma  8 


and 


j  (x±l  X±)j  I 
i  (x±|  XrF)j  -  0. 

l+)j  =  ^  (|x+)j  T-  lx-)j) 

-) .  =  _  l^_) .) . 


From  this,  it  can  easily 
the  |±)j.  basis  set: 


be  shown  that  dfc  has  the  following  useful  matrix  representation  in 

^  fc(+l*5’fc|+)fc  fc  (+1 ‘^■fc  |-)fc  A 


Lemma  9  The  influence  operator  for  a  logical  variable  is  Hermitean. 


Proof: 


Thus,  (Tfc  =  a-\.  Q.E.D. 

The  result  of  the  action  of  a  rotation  operator  upon  a  child’s  truth  state  ket  vector  is  defined 
by  the  following  theorem. 

Theorem  10  Rjk  (ajk)  |  ^k)  =1  -  cijfc)  ■ 
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Proof:  First  expand  the  operator  in  a  power  series: 


n=0 


h  (2n)! 


°o  A  >2n+l 

(lOljkCTk) 

h  (2»+')' 


Clearly,  =  (-I)"  and  =  (-l)”i-  Also,  using  the  above  matrix  representation  for 

CTfc,  it  is  easily  verified  that  =  U,  and  =  ifcdfc  =  dfc  ,  where  ifc  is  the  identity 

operator  for  variable  k.  Then 


Rjk  {(^jk)  —  Ifc 


71=0 


(-l)"Kfc) 

(2n)! 


2n  oo 


7^=0 


(2n-f  1)! 


Thus, 


=  ifc  cos Ojfc  +  idfc  sin  ajk. 

Rjkic^jk)  I  i9fc)  =  (ifccosajfc  +  idfcsinojfc)  |  0fc). 


But, 

(IfcCosojfc +  icrfcSinajfc)  |  dk)  = 


cos  Ojfc  0 
0  cos  ajk 


0 


sin 


—  sin  ajk  0 


_  I  cos  ajk  s\n  ajk 
—  sinoj-fc  cos  ajk 


cos  9k 
sin^fc 


_  !  cos  9k  cos  ajk  4-  sin  9k  sin  a  jk 
sin  9k  cos  ajk  —  cos  9k  sin  ajk 


cos(6»fc  -  ajk)  i  _ 


9  k  O^jk)' 


Hence, 

Q.E.D. 


sin  {9k  -  ajk) 

Rjk  (oijk)  1  9k)  =1  9k  -  ajk) 


cos  9  k 
sin  9  k 


Thus,  the  action  of  the  rotation  operator  upon  the  truth  state  modifies  the  associated 
probability  amplitudes  by  changing  the  values  of  their  arguments  by  the  angle  ajk-  We  shall 
refer  to  the  initial  truth  state  of  a  child  node  before  it  is  rotated  due  to  the  influences  exerted 
by  any  of  its  parents  as  the  prior  —  state  of  the  child.  The  truth  state  of  a  child  after  it  has 
been  rotated  is  called  the  post  —  state  of  the  child.  Clearly,  the  state  |  9k)  in  the  last  theorem  is 
the  prior-state  for  the  node  and  |  9k -ajk)  is  its  post-state  (recall  from  the  previous  section, 
however,  that  we  require  0  <  6»fc  -  a^fc  <  f ).  Although  this  theorem  generally  eliminates  the 
need  to  use  a  matrix  representation  for  the  rotation  operator  when  processing  the  influences  in 
an  inference  system,  it  is  easy  to  see  from  the  proof  that 


Rjk  (®jfc) 


cosajfc  sinttjfc 
—  sin  ajk  cos  ajk 
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2.5  COMMUTATORS,  GLAUBER’S  THEOREM,  AND  PROPERTIES  OF  ROTATION 
OPERATORS 


Before  further  examination  of  the  properties  of  rotation  operators,  it  is  first  necessary  to 


introduce  the  commutator 


X,  Y  for  operators  X  and  Y  defined  by: 


X,y  =XY-YX. 


If  A,Fj  =  0,  then  operators  X  and  Y  are  said  to  commute  and  the  order  in  which  the 
operators  are  applied  to  a  system  ket  vector  is  irrelevant.  Obviously,  every  operator  commutes 
with  itself.  Truth  and  influence  operators  which  act  upon  distinct  ket  vectors  also  commute. 
The  following  lemma  summarizes  these  facts  for  the  truth  and  influence  operators  and  is  stated 
without  proof. 

Lemma  11  The  truth  and  influence  operators  obey  the  following  commutation  relations  ; 

(a)  {fj,fk]  =  [dj,(7fc]  =  0, 


and 

(6)  [fj,dk\  =  0,i  #  k. 


In  order  to  evaluate  the  commutator  it  is  first  necessary  to  introduce  and  define 

the  permutation  operator  itj  for  the  logical  variable.  We  do  this  in  terms  of  its  matrix 
representation  given  by 


0  1 
1  0 


It  is  easy  to  see  from  this  why  ■kj  is  called  the  permutation  operator:  when  kj  acts  upon  a  truth 
state,  it  interchanges  the  probability  amplitudes  associated  with  the  truth  eigenkets  |±).  More 
precisely, 


kj  \ej)  = 


0  1 
1  0 


cos  9j 
sin  6j 


sin  9j 
cos  9j 


Lemma  12  kj  is  Hermitean. 


Proof:  Obvious  from  the  matrix  representation  for  kj.  Q.E.D. 

It  is  now  an  easy  matter  to  evaluate  the  commutator  [fj,kj]. 
Lemma  13 


[fj,aj]  =  -2ikj. 
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Proof: 


=  —2iTtj. 


Q.E.D. 

The  next  lemma  provides  an  interesting  result  that  will  be  useful  to  our  development  below. 
Lemma  14 


Proof: 


(tTj)  =  Atj. 


=  2sm6jCos9j 
=  Atj. 


Q.E.D. 


Also  required  for  our  discussion  of  rotation  operators  is  Glauber’s  theorem[2]  which  we  also 
state  without  proof.  This  theorem  defines  a  useful  algebraic  relationship  for  operator  valued 
exponential  functions  in  terms  of  the  commutation  properties  of  the  operators. 

Theorem  15  (Glauber’s  Theorem)  //j^A’,Fj  =0,  then 

^  gVgX  ^  gX+y 

Now  let  us  turn  our  attention  to  the  rotation  operator.  An  arbitrary  operator  O  is  unitary 
when  &0  =  OO^  =  i.  The  unitarity  of  the  rotation  operator  is  easily  proven  in  the  following 
lemma. 

Lemma  16  Rjk  (ajfc)  is  unitary. 
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Proof:  Because  dk  is  Hermitean,  we  can  write 

Since  [ak,d'k]  =  0,  Glauber’s  theorem  applies  so  that 

^k  M  4-fe  (“ifc)  =  =  e°‘  =  ifc. 

Similarly  for  Rjk  (ajk)  R^jk  i^jk)  ■  Q-E-D. 

The  unitarity  of  the  rotation  operator  is  important  because  it  ensures  the  normalization  of 
the  post-state  that  results  from  application  of  a  rotation  operator  to  the  associated  (normalized) 
prior-state.  This  guarantee  is  provided  by  the  next  lemma. 

Lemma  17  {6k  \  R^k  i^jk)  Rjk  (ajfc)  |  ^k)  =  1- 

Proof:  Since 


Rjkiajk)  \dk)  =  \6k  -  ajk)  =cos(6>fc  -  ajk)  \+)^.  +  sin  {6k  -ajk)  |-)fc, 

then 

{6k  Ojfe  I  dk  ^jk)  ~  *^0®  (^fc  ^jk)  "I"  ®ill  {^k  ^jk)  ~  1- 

Therefore, 

{6k  ~  0^jk\  ^k  ~  ^jk)  ~  i^k  I  ^jk  i^jk)  ^jk  {(^jk)  \  ^k)  —  1- 

Q.E.D. 

The  order  in  which  rotation  operators  are  applied  to  truth  states  can  be  important  when 
the  rotation  angles  are  functionally  dependent  upon  the  associated  parent  truth  states.  In  such 
cases,  order  indices  can  be  used  to  impose  the  necessary  order  of  their  application.  Specifically, 
order  indices  m  and  n  are  counting  numbers  appearing  as  superscripts  on  rotation  operators 
R^  (aij)  and  (at;),  respectively,  which  indicate  that  (q,j)  must  be  applied  to  \6j)  before 
{oLki)  can  be  applied  to  \6i)  when  m  <  n.  U  m  —  n,  then  the  order  of  their  application  is 
unimportant.  Consequently,  the  following  commutation  relations  hold  when  order  indices  are 
used: 

Lemma  18 


(^ij) ,  Rki  («fcj)]  =  0,  m  =  n, 

lR^(aij),R^i(aki)]  y^O, m^^n, 
and  similarly  for  the  associated  adjoint  operators. 

Proof:  The  first  relation  is  a  direct  consequence  of  the  fact  that  the  influence  operator  is 
Hermitean  and  that  [dj,  d;]  =  0.  The  second  follows  from  the  fact  that  the  application  order  is 
important  when  order  indices  are  different.  Q.E.D. 
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Clearly,  when  m  =  n  for  every  m  and  n,  i.e.  when  the  order  of  application  for  any  rotation 
operator  is  unimportant,  the  index  is  superfluous  and  should  not  be  used  (e.g.,  when  all  the 
rotation  angles  are  constant).  However,  when  order  is  important  (e.g.,  when  some  of  the  rotation 
angles  are  functionally  dependent  upon  their  parent  truth  states),  then  it  is  required  that  every 
rotation  operator  used  to  specify  a  truth  state  be  superscribed  with  an  order  index  and  that 
the  operators  be  properly  ordered.  A  truth  state  specification 

i®>  =  (“i-j.)  (“.-.i-.)  ■  ■  ■  SB.  (On*)  i»..>  •  •  ■ 


is  properly  ordered  if  rim  ^  ^m-i  ^  ‘  ^  ^i-  For  sake  of  simplicity  the  following  notational 

convention  is  assumed:  truth  state  specifications  are  implicitly  properly  ordered  when  order 
indices  are  omitted  from  the  associated  rotation  operators.  Note  that  this  convention  also 
subsumes  all  cases  where  rotation  operator  application  order  is  unimportant.  This  notation 
is  important  for  the  formal  algebraic  specification  of  the  general  truth  state  for  an  inference 
system  which  possesses  many  influences  between  its  logical  variables. 


Let  J  be  the  index  set  for  parent  variables  which  influence  the  child  variable  k  so  that 
Uj^jRjkiajk)  is  the  associated  properly  ordered  string  of  rotation  operators  which  act  upon 
variable  k's  prior  truth  state  and  their  cumulative  rotation  of  the  prior  truth  state. 

The  following  theorem  gives  the  post  truth  state  for  a  child  variable  that  results  from  the 
cumulative  influences  of  multiple  parents. 

Theorem  19  Uj^jRjk  (ojfe)  |  Ok)  =|  Ok  - 

Proof:  Since  [&k,drk]  —  0,  Glauber’s  theorem  applies  so  that 

UjsjRjk  {ajk)  I  Ok)  =  e  |  Ok)  =|  Ok  -  Ej^J^jk)- 


Q.E.D. 


As  mentioned  above,  it  is  required  that  0  <  O'f.  =  Ok  —  Ej^j^jk  <  f  •  Thus,  the  following 
door-stop  rule  must  be  imposed  upon  truth  vector  post-states: 


0,  when  Ok  -  Ej^j  <^jk  <  0 
f ,  when  Ok  -  Ej&j  <^jk  >  f  • 
Ok  -  Ej€J  Oijk.  otherwise 


It  is  important  to  note  that  if  \xk)  =  ^jejRjk  {oijk)  then 

(Xfel  ^k  iXfc)  —  {^k\  ^j€J^]k  {^jk)^k^j£jRjk  {(^jk)  \0k) 

is  the  expected  truth  value  for  the  rotated  truth  state.  Therefore,  in  general,  (f  fc)  can  represent 
either  the  expected  truth  value  for  an  independent  variable  or  for  a  rotated  truth  state.  The 
context  in  which  the  notation  (ffc)  is  used  will  make  it  clear  whether  one  or  the  other  or  both 
of  the  possibilities  apply. 
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As  an  illustration  of  these  results,  consider  the  truth  state  given  by 

Note  that  since  the  rotation  angles  are  constant,  the  order  of  application  of  the  rotation  oper¬ 
ators  is  unimportant,  the  associated  rotation  operator  commutator  vanishes,  and  order  indices 
can  be  omitted.  Consequently,  we  can  interchange  the  order  of  the  operators  in  the  last  equa¬ 
tion  and  not  effect  the  final  result.  Also,  the  last  theorem  enables  us  to  rewrite  this  equation 
as 


Here,  the  influence  of  the  variable  is  to  increase  the  “falseness”  of  the  variable,  whereas 
that  of  the  variable  is  to  increase  the  “trueness”  of  the  variable.  The  combined  effect 
of  the  two  influences  is  to  generally  increase  the  “trueness”  of  the  variable’s  post-state. 
Observe  that  the  truth  value  of  the  prior-state  of  the  variable  is  completely  uncertain 
because  9k  =  The  influences  induced  by  the  and  Z**  variables  via  the  rotation  operators 
remove  this  uncertainty  in  the  post-state  for  the  variable.  Clearly,  in  this  case  it  is  not 
necessary  to  employ  the  door-stop  rule  because  the  angle  associated  with  the  post-state  truth 
vector  lies  within  the  required  [O,  §]  interval. 

In  order  to  illustrate  order  indices  and  their  utility  for  imposing  order,  as  well  as  controlling 
cyclic  influences  (including  self  influences),  consider  the  properly  ordered  truth  state  defined 

by 

(2-4) 

~  ^km  i^km)  ^k  i^lk)  ^kl  i^kl)  \9j)  \9k  —  Oijk)  l^i)  \9m) 

~  i^km)  Rfk  i^lk)  l^fc  ~  ^jk)  \9l  —  O-kl)  \9m) 

~  i^km)  \9j)  \9k  Cljfc  ttifc)  \9l  Ockl)  \9rn) 

—  \9j)  \9k  ^jk  ®/fc)  \9l  Q^fcz)  \9m  ^km)  ■ 

Here  variable  j  is  independent,  truth  states  |0fc)  and  \9i)  cyclically  influence  one  another,  and 
truth  state  \9k)  influences  state  l^m)-  Although  the  cyclic  influence  is  for  a  single  cycle,  it 
is  easy  to  see  that  multiple  cycles  merely  require  the  properly  ordered  insertion  of  additional 
appropriately  indexed  rotation  operators  in  the  truth  state  specification. 

Observe  that  if  all  the  rotation  angles  are  constant  for  a  truth  state  specification  (and 
therefore  independent  of  the  associated  parent  truth  states),  then  the  order  index  is  unnecessary 
because  the  same  truth  state  is  obtained  regardless  of  the  order  in  which  the  rotation  operators 
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are  applied.  However  -  as  mentioned  above  -  this  is  generally  not  the  case  when  rotation  angles 
are  functions  of  their  associated  parent  truth  states.  For  instance,  suppose  in  our  example  that 


ajk  =  0.259j, 
Otkl  =  O.lO^fc, 

tt/fc  =  Gi, 
and 

a  km  =  0.500k.  , 


(2-5) 


In  this  case,  the  truth  state  is  given  by 

\^)  =  lOj)  lOk  -  D.250j  -  [0i  -  O.lO(0fc  -  0.250,)])  • 

10/  -  O.lO(0;fc  -  0.250,))  • 

\em  -  O.5O{0fc  -  0.250,-  -  [0/  -  O.lO(0fc  -  0.250,)]}) . 

The  ability  to  specify  the  order  of  influences  via  order  indices  is  an  especially  important  and 
necessary  capability  when  rotation  angles  are  functions  of  their  associated  parent  truth  states 
because  different  rotation  operator  application  orders  will  generally  yield  different  truth  states. 
As  an  illustration  of  this,  suppose  the  rotation  operator  order  indices  2  and  3  are  interchanged 
in  our  above  specification  for  |5').  This  produces  a  truth  state  given  by 

1^-)  =  |0,)  0.250^-0,)  10/ -O.lO(0fc- 0.250,- -0/))-  (2-6) 

|0^  -  O.5O(0fc  -  0.250,-  -  0/)) 

which  is  obviously  different  than  that  of  the  original  specification  (note  however  that  if  the 
rotation  angles  are  constants,  identical  truth  states  are  obtained). 

Before  closing  this  section,  we  introduce  the  ordering  operator  5(r[  ].  This  operator  is  simply 
a  formal  mathematical  convenience  which  can  be  used  to  explicitly  denote  that  the  string  of 
rotation  operators  enclosed  by  its  brackets  is  properly  ordered  according  to  some  ordering 
scheme  a  (e.g.,  a  digraph  -with  vertices  representing  variables  and  numbered  arcs  representing 
the  order  of  influences).  If  there  is  only  one  ordering  scheme  under  consideration,  then  the 
subscript  a  is  omitted.  Thus  9(r[  ]  is  useful  for  simplifying  the  specification  of  a  truth  state 
(especially  during  its  developmental  phase)  by;  (1)  enabling  properly  ordered  strings  of  rotation 
operators  to  be  written  in  any  order;  and  (2)  providing  a  simple  notation  for  distinguishing 
between  diflFerent  orderings  for  a  string  of  rotation  operators  (the  number  of  such  orderings  has 
been  studied  by  Fisher  [5]). 

2.6  THE  TRUTH  STATE  VECTOR  FOR  A  GENERAL  INFERENCE  SYSTEM 

When  arbitrary  influences  are  involved  in  an  inference  system,  then  the  associated  system’s 
truth  state  vector  assumes  the  general  form 

1  ^)  =  d[n]  1 0), 
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where  |  0)  is  the  tensor  product  of  all  the  independent  logical  variables  and  prior  truth  state 
child  variables  in  the  Dirac  network  and  it  is  the  string  of  rotation  operators,  one  operator  for 
each  parent-child  pair  in  the  system.  We  shall  refer  to  such  a  state  vector  as  a  general  truth  state 
and  the  system  it  represents  as  a  general  inference  system.  As  evidenced  by  the  illustration  in 
the  last  section,  a  general  truth  state  may  be  rewritten  as 

1^)  a[7^]|T)|E) 

=  |E)a[7^]|T), 

where  |  E)  is  the  tensor  product  of  the  truth  states  for  all  the  independent  variables  in  the 
S5^tem  and  |  T)  is  the  tensor  product  of  the  prior-states  for  all  the  child  variables  in  the 
system.  The  following  theorem  guarantees  the  normalization  of  a  general  truth  state. 

Theorem  20  Every  general  truth  state  is  normalized. 

Proof:  Since  |  0)  =[  T)  |  E),  then 

I  =  d[1i]  \  0) 

~  Rij  (^ij)  E/cl  {oi/cl)  •  •  •  Rmn  (oimn)  |  '^)  1  — )• 

Therefore, 

(W  I  '!')  =  (—  I  (T  I  (otmn)  '  ' '  EJj  {ocij)  Eij  (ctij)  •  •  •  Emn  (0!mn)  I  1  •— )• 

From  the  unitary  and  commutation  properties  of  the  rotation  operators  and  the  fact  that  truth 
states  for  logical  variables  are  normahzed,  we  obtain 

{^\  ^)  =  (E  I  (T  I  RL  {amn)  Emn  (cCmn)  ’  ’  '  E^  (q^)  Eij  {aij)  |  T)  |  E) 

=  (E  I  (T  I  i  •  •  •  i  I  T)  I  E) 

=  (E  I  (T  i  T)  I  E) 

=  (H  I  H)(T  I  T) 

=  1. 


Q.E.D. 


This  result  is  easily  verified  using  the  illustration  of  the  previous  section  (since  the  rotation 
angles  are  constant,  the  rotation  operator  ordering  doesn’t  matter): 


($1$) 


TT 

3i 


TT 


/  Ttt 


(6 

h 

36  fc 


TT  \  /  TT 


6i/\fc4| 

=  1-1-1  =  1. 
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2.7  TRUTH  MEASUREMENTS  FOR  GENERAL  INFERENCE  SUBSYSTEMS 

The  collection  of  truth  operators  for  the  logical  variables  of  a  general  inference  system  may 
be  used  to  construct  a  generalized  truth  operator  for  any  subset  of  logical  variables  in  the 
system.  In  particular,  let  N  be  the  set  of  indices  for  the  logical  variables  in  a  general  inference 
system  and  let  0  ^  S'  C  A  with  cardinahty  |S|  (clearly,  S  defines  an  inference  subsystem  with 
vector  space  dimension  Then  the  generalized  truth  operator  for  the  inference  subsystem 
with  variables  S  is 

As  the  following  theorem  and  corollary  show,  the  expected  truth  value  for  any  such  subsystem 
is  simply  the  average  of  the  expected  truth  values  for  the  variables  in  S. 

Theorem  21  If  \  ff)  is  the  general  truth  state  for  a  general  inference  system  with  a  subsystem 
defined  by  the  variable  index  set  S,  then 

'  1^1  i€5 

where  (fj)  is  the  expected  truth  value  for  the  logical  variable. 

Proof:  Let  N  index  the  variables  in  a  general  inference  system  and  let  |  5')  =|  77)  |  ^), 
where  |  g)  and  |  are  the  tensor  products  (denoted  by  ©) 

I  V)  =  ©i€W-S  I  Mi)  and  |  0  =  ©fce5  I  4)- 

Here  |  Pj  )  and  |  (pff)  represent  both  independent  variable  states  and  post-states  for  child 
variables.  Then 

{^\fs\^)  =  {g\{^\fs\0\  V) 

=  {ri  I  V){^  \fs\0 

-  (e  I  Ts  U) 

=  {®k£s{4>k  I)  {®keS  I  0fc)) 

=  ill  E  I  I  I  *))} 

=  iliE»,|7,i« 

Q.E.D. 
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Corollciry  22  —1  <  {Ts)  <  1- 

Proof:  Obviously,  the  maximum  value  for  {T$)  occurs  when  {fj)  =  1  for  every  j  €  S.  Then 

Similarly,  the  minimum  value  occurs  when  (fj)  =  —1  for  every  j  6  S.  Then 


jes 


|5| 


Q.E.D. 


Analogous  to  the  case  for  the  expected  truth  value  for  a  single  variable,  when  (Ts'^  >  0 
(<  0),  the  conjunction  of  the  assertions  of  variables  in  S  is  more  true  (false)  than  false  (true) 
and  when  =  0)  the  conjunction  is  precisely  as  true  as  it  is  false  and,  thus,  uncertain. 

We  quantify  the  truth  uncertainty  ATs  associated  with  variables  in  S  as  the  average  truth 
uncertainty  given  by 

Lemma  23  0  <  ATs  <  1. 

Proof:  The  maximum  ATs  value  occurs  when  Arj  =  1  for  every  j  G  S.  Then 

|5! 


AT  —  _L  1  =  =  1 

I  <71 


jes 


15| 


When  the  minimum  value  Atj  =  0  is  assumed  for  every  j  G  5,  then  ATs  =  0.  Q.E.D. 

These  results  can  be  seen  in  the  following  example  where  we  consider  the  general  truth  state 
used  above  with  N  =  {j,  l,k}.  If  5  =  then  |5|  =  2  so  that 


Ts=  Q)  (tj+ffc) 


and 


Ts)  = 


2  ) 


=  G){H 


+ 


cos 


h 

36 


—  sin 


I)]} 


1 
2 

0.421. 


-  )  {0.5000  +  0.3420} 
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Note  that  the  post-state  is  used  to  calculate  (ffc).  The  associated  truth  uncertainty  is 
AT5  =  Q  j  (At;  -r  Axfc) 


=  [0.8660  ^  0.9398] 

=  0.9029. 

2.8  THE  STATISTICAL  OPERATOR  AND  VON  NEUMANN  ENTROPY  FOR  AN 
INFERENCE  SYSTEM 

As  we  have  seen,  both  (fs'j  and  ATs  measure  the  “truthfulness”  of  assertions  defined  by 
S,  i.e.  the  truth  certainty  increases  as  (Ts^  il  (while  simultaneously  ATs  0)  and 

decreases  as  (Ts'^  — >  0  (while  simultaneously  ATs  — »  !)•  Even  though  (Ts'J  and  ATs  are 
logic-based  quantities,  using  them  to  characterize  the  truth  uncertainty  associated  with  Dirac 
networks  may  be  adequate  for  many  applications.  However,  since  the  theory  underlying  Dirac 
networks  is  physics-based,  an  associated  entropy  can  be  calculated  to  provide  an  alternative 
quantification  of  a  system’s  uncertainty.  Unlike  (Ts^  and  ATs,  entropy  measures  the  lack  of 
information  about  a  general  truth  state.  Note,  however,  that  its  value  says  nothing  about  the 
origin  of  this  indeterminacy. 

To  define  our  entropy  measure,  it  is  first  necessary  to  introduce  the  statistical  operator.  Let 
N  and  L]\f  be  as  defined  above.  If  0  ^  A  C  and  u>  €  L{,f,  then  define  zv  to  be  that  obtained 
from  u)  by  deleting  from  lj  all  Uj  such  that  j  €  {N  —  A)  and  define  La  as  the  set  of  all  such  w. 
Observe  that  \Ln\  >  \La\  and  that  {[  ro)  |  ru  €  La]  is  a  truth  basis  set  which  spans  the  2l^l 
dimensional  vector  subspace  associated  with  the  inference  subsystem  defined  by  A. 

Now  let 

\^a)=  XI  ^  I 

be  the  general  truth  state  for  the  subsystem  defined  by  A  with  probability  amplitude  i/{w)  = 
V  {ojjUJk  ■•■oji)  =  h  {ujj)  h  {uJk)  ■■•h  (cJi),  where  the  function  h  is  as  defined  above.  The  statistical 
operator  Pa  for  the  general  inference  subsystem  spanned  by  the  basis  set  specified  by  La  is 
defined  as 

Pa=  \  (^)  I  • 

The  trace  of  Pa  is  the  sum  Tr  {p^}  =  Y^&La  Pa  I^)  •  As  the  next  lemma  shows,  the  trace 
of  Pa  is  unity. 

Lemma  24  Tr{pjf]  =  1. 
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Proof: 


Tr{pA]= 

=  I  (^')  I 

■uj£La  t^'&La 

■c:!€La  w'&La 

=  Y. 

■cjSLa 

=  i^A  \^a)  =  1 


since  |'5'^)  is  normalized.  Q.E.D. 


We  shall  use  the  von  Neumann  entropy  to  provide  a  measure  of  the  “informational  un¬ 
certainty”  associated  with  a  general  truth  state.  The  von  Neumann  entropy  Ea  for  a  state 
consisting  of  logical  variables  indexed  by  the  set  A  is  defined  as 


EA  =  -KTr{pAlnpA}  =  -K 


where  k  is  a  constant  and  we  define  0  •  In  0  =  0.  As  the  following  results  show,  the  von  Neumann 
entropy  has  a  very  useful  additive  property  which  enables  the  straightforward  calculation  of  its 
value  for  multi-variable  systems. 

Lemma  25  If  \  if)  =|  ip)  \rj),  then  E^  =  E-^-r  Efj. 

Proof: 

E^,  =  -KTr-|p^lnp^| 

=  -K(Tr^^p^®p^lnp^-~p^®p^lnp^^'^ 

=  -K  (Tr^  \^p^lap^^Trr,  -f-  Tr^  Tr^  [pj,  Inp^}) 

=  -K  (Tr^^p^hip^^  +Trr,  |p^lnp^|) 

=  E^  -1-  Erf, 

where  ©  denotes  a  tensor  product  and  we  have  used  the  fact  that  the  trace  of  a  statistical 
operator  is  unity.  Q.E.D. 

Theorem  26  Let  N  index  the  logical  variables  in  a  general  inference  system  luith  ^  A  Q  N . 
Then,  Ea  = 

Proof:  This  is  a  direct  consequence  of  the  last  lemma  and  the  fact  that  the  general  truth 
state  I  ^a)  is  a  tensor  product  of  the  truth  states  for  every  variable  in  the  index  set  A.  Q.E.D. 

The  minimum  entropy  Ea^^^  =  0  occurs  when  (n/)  =  1  for  some  n/  €  La  (which  implies 
1/2  fca)  —  0  for  all  w  €  La  -  {ot'};  recall  that  0  •  InO  =  0),  i.e.,  when  |  =|  for  some 
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w'  €  La-  In  this  case  there  is  complete  knowledge  of  the  truth  values  for  the  system’s  variables 
and  general  truth  state  |  is  certain.  The  maximum  value  for  Ea  (i-e.,  the  maximum 
informational  uncertainty  for  |  occurs  when  it  is  equally  probable  that  |  5'^)  has  the  truth 
values  given  by  the  ket  vector  |  w)  for  every  zu  €  La,  i-e.,  when  v  {w)  =  -7^=  for  each  w  &  La- 

Then, 

=  Kln|z,^| . 


It  is  convenient  to  quantify  the  informational  uncertainty  associated  with  a  general  truth 
state  I  )  in  terms  of  the  system  uncertainty  Aa  defined  by 


Ea 

Ea 

■^max 


Ej 

Kin  IT.4I  ■ 


Lemma  27  0  <  Aa  <  1- 
Proof:  Obvious.  Q.E.D. 

Thus,  the  closer  the  value  of  A.4  is  to  0  (1) ,  the  more  certain  (uncertain)  the  information 
contained  in  the  system’s  general  truth  state  | 

As  an  illustration,  consider  the  general  truth  state  of  the  previous  section  indexed  by  = 
{j,  I,  k}-  Let  A  =  {j,  k}  so  that 


\^a)  = 


IT 


h 

36  fe 


=  cos  (I)  cos  (1^)  1+-!-)  -c  cos  (I)  sin  (g)  [+-) 


77 

Sin  I  I  cos 
.  o 


|)|-+)+sin(0sm(| 


where  -  for  the  sake  of  brevity  -we  have  suppressed  the  j  and  k  subscripts  in  the  basis  state 
expansion.  The  statistical  operator  for  this  state  is  given  by 


2  I 

36 


I  — h)  sin^  (  -  )  cos 
.  o 


)(-. 


-t-  -i-  — )  sin 


77 


sin 


36/  ' 


It  is  easily  verified  from  this  that 

Tr{PA  =  cos^(l)c<»i'(|)+cos^(l)sin^(|)- 
sm’(0cos^(|)+si, 


=  cos 


3 

2  /  777\ 


h 

36 
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The  von  Neumann  entropies  for  each  state  in  the  index  set  A  are  readily  calculated  as 
follows: 


Using  the  additive  property,  the  entropy  for  is  found  to  be  Ea  =  Ej  Ek  =  1.1958k. 
Since  \La\  =  2l"^i  =  2^  =  4,  then  Ea,^^  =  Kln4  =  1.3863k  so  that 


1.1958k 

1.3863k 


0.8626 


(contrast  this  value  for  with  that  for  ATg  calculated  above).  Notice  that  if  there  are 
no  rotation  operators  associated  with  the  state  defined  by  the  index  set  N,  then  variable  k 
is  independent  and  has  truth  state  f  The  entropy  for  this  state  therefore  assumes  the 
maximum  value 


Ek  =  -K  jcos^  In  cos^  ^ 


+  sm^ 


wlln 


sm 


)]}  =  ■ 


6931k 


=  Kln2  = 


In  this  case,  the  state  is  more  informationally  uncertain  than  that  of  the  previous  example. 
Specifically,  =  0.9056  >  0.8626. 


2.9  THE  INFLUENCE  ENERGY  OPERATOR  FOR  A  LOGICAL  VARIABLE  AND 
THE  TIME  EVOLUTION  OF  ITS  TRUTH  STATE 


When  the  truth  state  for  a  logical  variable  is  time  dependent,  then  the  Schrodinger  equation 
may  be  used  to  determine  an  influence  energy  operator  for  the  variable. 

Theorem  28  The  influence  energy  operator  for  a  logical  variable  with  truth  state  \0j)  is  given 
by  hj  =  Oj&j,  where  dj  is  the  influence  operator  and 


Proof:  The  equation  of  motion  for  |0j)  is  given  by  the  Schrodinger  equation 


,d\0j) 

dt 


—  hj  \  0j) , 


where  i  =  \/—l  (for  obvious  reasons  we  assume  that  Planck’s  constant  has  unit  value  and  take 
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liberty  to  refer  to  hj  as  an  energy  operator).  Since  |0j)  =  cos9j  |+)^.  +  sin^j  then 

Me^) 


dt 


-idj  sin  6j  I  +}j  -r  iOj  cos  6j  \ —)^ 


When  this  equation  is  expressed  in  matrix  form,  it  becomes 


d  ( cos  O-i 


dt  \  sin  6i 


=  id 
—  idj 

=  e, 


—  sin 
^  ^  cos  9  j 

'  0  -1 

1  0 

0  -i 

i  0 


(cos  9j 
sin  9j 

fcos9j\ 
1  sin^j  ) 


Rewriting  this  result  in  ket  -  operator  form  yields 

\^j) 

from  which  it  is  readily  seen  that  hj  =  djdj.  Q.E.D. 

Since  hj  is  proportional  to  the  influence  operator  aj  we  call  it  the  influence  energy  operator 
for  variable  j. 

Lemma  29  hj  is  Hermitean. 

Proof:  This  is  a  direct  consequence  of  the  fact  that  &j  is  Hermitean  and  9j  is  a  real  number. 

The  influence  energy  operator  for  a  logical  variable  is  important  because  it  defines  a  time 
evolution  operator  that  describes  the  influence  of  time  upon  its  associated  truth  state  (i.e.,  it 
describes  its  dynamical  behavior). 

Theorem  30  If  a  logical  variable’s  truth  state  at  time  tg  is  \9j{tQ)),  then  its  truth  state  at  time 
ti  >  to  is 

\9j{h))  =  Ujih,to:9j)  \9jito)), 

where 

■Tt  /  n  \  r  ^  hidt  —i&j  f  ^  6jdt 

Uj{tuto;9j)  =  e  ^  =e 
is  the  time  evolution  operator  for  the  logical  variable. 

Proof:  Since 


d\9j^  _  -  I  , 


then 


Jtn 


d\9 

J\0Ato))  \^j 


) 
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so  that 


or 

Q.E.D. 
Corollary  31 


/•ti  . 

?j(to)  +  / 

Jto 


Proof:  Observe  that  since  hj  =  djd 


Uj  (ti ,  to!  ^i)  —  ^ 


has  the  same  exponential  form  as  a  rotation  operator  with  rotation  angle  -  9dt.  Conse¬ 
quently,  the  proof  is  analogous  to  that  of  theorem  10.  Q.E.D. 


Lemma  32  is  unitary. 

Proof:  This  result  follows  from  the  fact  that  aj  is  Hermitean,  =  0,  and  Glauber’s 

theorem.  Q.E.D. 

The  following  lemma  shows  that  a  truth  state  remains  normalized  during  its  time  evolution. 


Lemma  33  =  (0j(to)|  ^i(^o))  =  1- 

Proof:  This  result  follows  from  the  unitarity  of  the  time  evolution  operator.  Specifically, 
since 

|6»j(ti))  =  Uj{ti,to;  9j)  |6'j(to)) 

then 

so  that 

=  (0,(to)|  i  |0i(to)>  =  (^,(to)|  ej{to))  =  1. 


Q.E.D. 

There  is  an  obvious  imphcit  natural  time  order  that  governs  the  order  of  application  of  mul¬ 
tiple  time  evolution  operators  with  the  same  truth  variable  subscript  to  the  associated  truth 
state.  In  particular,  if  t2  >  h,  then  Uj{ti,to',9j)  must  be  applied  to  \9j)  before  Uj{t3,t2-,9j). 
When  this  is  the  case  we  say  that  Uj{t3,t2\9'j)Uj{ti,to-,9j)  \9j{to))  is  a  properly  time  ordered 
evolution  (note  from  this  that  the  rates  of  evolutionary  change  can  be  different  for  each  time 
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evolution  interval).  Consequently,  we  require  (and  always  assume)  that  all  time  evolution  spec¬ 
ifications  for  a  variable  are  properly  time  ordered  (e.g.,  the  previous  specification  is  undefined 
if  to  <t2  <  ti). 

When  both  time  evolution  operators  and  rotation  operators  are  used  to  specify  a  truth 
state,  then  one  of  two  (application  dependent)  cases  are  possible:  (1)  the  time  evolutions  and 
rotations  are  independent  of  one  another;  or  (2)  the  order  in  which  they  are  applied  to  the 
truth  state  is  important.  When  the  former  case  prevails,  then  the  truth  state  obtained  from  a 
time  evolved  child  truth  state  is  the  same,  regardless  of  the  order  in  which  the  associated  time 
evolution  and  rotation  operators  are  applied  to  the  prior-state  of  the  child.  The  essence  of  this 
case  is  described  formally  in  the  following  lemma  and  its  corollary  (however  -  as  required  above 
-  the  time  evolution  operators  must  still  be  properly  time  ordered).  Clearly,  the  cumulative 
eflFect  of  rotations  and  time  evolutions  upon  a  truth  state  remain  constrained  by  the  door-stop 
rule. 

Lemma  34  ^Uj{ti,to;dj),  R^fiaki)^  =  0  and  similarly  for  the  associated  operator  adjoints. 

Proof:  This  is  a  direct  consequence  of  Glauber’s  theorem,  the  fact  that  the  influence 
operator  is  Hermitean,  and  [^j,d'i]  =  0.  Q.E.D. 

Corollary  35  j^C/j(ti,to;^j)»'9cr['^]]  =  0. 

Proof:  This  follows  directly  from  the  last  lemma  and  the  definition  of  the  ordering  operator. 

Q.E.D. 

When  case  (2)  prevails,  then  order  indices  can  be  used  to  specify  not  only  the  application 
order  for  rotation  operators,  but  also  the  application  order  in  which  time  evolution  operators 
are  to  be  applied  to  a  truth  state  relative  to  the  application  order  of  the  rotation  operators.  As 
required,  the  time  evolution  operators  must  still  be  properly  time  ordered  relative  to  themselves. 
The  following  lemma  and  corollary  formalize  this  case. 

Lemma  36 

[cj”(ti,to;0j),Rfci(afc/)]  =  0,m  =  n 

and 

[uTituto;9j),RM{aki)]  ^0,m:^n. 

Proof:  The  result  for  m  =  n  follows  from  Glauber’s  theorem  and  [&j,ari]  =  0.  The  result 
for  m  ^  n  follows  from  the  definition  of  order  index.  Q.E.D. 

For  notational  convenience,  we  shall  adopt  the  convention  that  ^  represents  either  a  string 
of  rotation  operators,  a  string  of  time  evolution  operators,  or  a  combined  string  of  both  rotation 
and  time  evolution  operators.  Furthermore,  we  shall  also  endow  the  ordering  operator  with  the 
additional  property  that  d„[^]  is  an  ordering  of  ^  according  to  a  scheme  cr  which  ensures  that 
the  time  evolution  operators  are  also  properly  time  ordered.  The  next  result  is  the  obvious 
consequence  of  the  unitarity  of  the  rotation  and  time  evolution  operators  and  is  stated  without 
proof. 
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Lemma  37  dc[&\  is  unitary. 

Before  closing  this  section,  we  note  the  following  useful  property  for  time  evolution  operators. 
Lemma  38  For  k  e  {1, 2,  •  •  •  ,n},  let  tk  >  tfc-i-  Then 

A  -fc 

fc=iA-i 

Proof:  This  is  a  consequence  of  the  definition  of  the  time  evolution  operator,  the  fact  that 
the  influence  operator  commutes  with  itself,  and  Glauber’s  theorem.  Q.E.D. 


^k=.imk.tk-x\e))\ej)  = 


2.10  THE  INFLUENCE  ENERGY  STATES  FOR  A  LOGICAL  VARIABLE 


The  influence  energy  operator  provides  for  an  alternative  description  of  a  logical  variable’s 
truth  state  in  terms  of  its  associated  influence  energy  basis  eigenvectors  instead  of  the  usual 
|±)  truth  basis  vectors.  This  is  easily  seen  by  recalling  that  the  influence  operator  aj  for  a 
variable  j  has  eigenvectors  |x±)j  which  are  linear  superpositipns  of  the  basis  ket  vectors  |±)^.. 
Specifically, 

^3  lx±)j  =  ±  |X±)^- , 

where 


Since  h,  =  dj&j,  then 


k  \x±)j  =  \x±}j  ■ 


Thus,  |Xd;)j  the  influence  energy  eigenstates  for  variable  j  and  ±9j  are  the  associated 
influence  energy  eigenvalues. 


Lemma  8  can  be  used  to  show  that  the  truth  state  for  a  variable  can  be  represented  in  terms 
of  its  influence  energy  eigenstates. 

Theorem  39 

Proof:  Substitution  of  the  results  of  lemma  8  for  |±)j.  into  \9j)  =  cos9j  |4-)j-  +  sm9j  |— )^- 
yields  upon  rearrangement 

\9j)  =  ^  j(cos6»j -isin6>j)  \x+)j  +  {cos9j  +  isin9j)  |X-)j}  • 

Recognizing  that  cos9j  ±ism9j  =  completes  the  proof.  Q.E.D. 
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Note  that  \6j)  remains  normalized  in  this  representation.  Also,  observe  that  since 

-  -fc  .  >  —iaj  f**  o’idt 

-i(±l)  //*=  B)dt 

=  e  •'‘*-1  \x±)j 

(to  see  this,  expand  tfc-i;  ^j)  in  a  Taylor  series  and  use  the  fact  that  dj  |x±)j-  =  (±1)  |x±)j)i 
then 

SO  that  the  time  evolution  of  a  truth  state  in  its  influence  energy  representation  is  conveniently 
recorded  as  simple  equal  and  opposite  phase  changes  for  each  of  its  influence  energy  basis  states. 
Of  course,  this  state  can  be  transformed  into  its  \.±)j  representation  via  the  associated  identity 
substitutions  for  |x±)j- 

2.11  THE  EQUATION  OF  MOTION  FOR  THE  EXPECTED  TRUTH  VALUE 

The  influence  energy  operator  is  important  not  only  for  describing  the  dynamics  of  Dirac 
network  truth  states,  but  also  for  providing  equations  of  motion  for  the  associated  expected 
truth  values.  In  this  section  we  will  derive  the  equation  of  motion  for  a  logical  variable’s 
expected  truth  value  and  briefly  discuss  its  significance  and  utility. 

To  derive  the  equation  of  motion  for  the  expected  truth  value  for  the  logical  variable  it 
is  first  necessary  to  obtain  the  time  derivative  of  (r j) : 

d  {fj)  ^  d{ej\fj  |6>j) 
dt  dt 

~  dt  dt 


dt 


i {ej\hjfj  \ej)  +  {0j\ ^  \ej)  - i {9j\  ffh,  \9j) 
i  (^jl  {^T'j  -  Tjhj)  \9j)  +  {9j\  ^  \9j) 

dTj 


=  -i  ^i]  \^j)  +  (^il  -J  \^o) 


or 


(2-7) 
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where  we  have  used  the  facts  that  hj  is  Hermitean, 

d{Bj 


dt 

dm 

dt 


—  i{Oj\hj, 

=  —ihj  , 


dt 


and  hj  =  Bj&j.  Equation  (2-7)  can  be  further  simplified  by  noting  that 
lemma  13  to  obtain 


0  and  using 


d  (fj)  _ 


dt 

or  by  appljdng  lemma  14  to  rewrite  this  as 


d  (fj)  _ 


dt 


—  29 j  (tTj) 


-29 j  ■  Atj 


(2-8) 


=  —49j  sin  9j  cos  9j . 


Equation  (2-8)  is  interesting  because  it  not  only  relates  the  rate  of  change  of  the  expected 
truth  value  for  a  variable  to  its  truth  uncertainty,  but  also  indicates  whether  the  state  is 
becoming  more  true  or  more  false  as  it  changes  in  time.  In  particular,  it  demonstrates  that: 


1.  if  9j  =  0,  then  the  expected  truth  value  for  variable  j  is  a  constant  of  the  motion; 

2.  for  a  fixed  (nonzero)  9j  the  rate  of  change  of  {fj)  varies  directly  with  the  value  of  the 
truth  uncertainty;  and 

3.  when  9j  <  0  (>  0)  the  rate  is  positive  (negative)  and  the  variable’s  truth  state  is  moving 
in  time  towards  being  precisely  true  (false),  i.e.  towards  the  eigenstate  |+)j  (|— )j)- 

Integration  of  the  last  equation  yields  the  desired  equation  of  motion.  Multiplying  both 
.  sides  of  Equation  (2-8)  by  dt  gives 

d  (fj)  —  —4  •  dj9j  sin  9j  cos  9j 


which  upon  integration  5delds 


(T,(to)) 


sin  9j  cos  9jd9j 


=  {"^jito))  -  2  jsin^  9j{t)  -  sin^  0j(to)}  ,  (2-9) 

9j{t)  =  9j{to)  ■+■  f  9jdt  (2-10) 

Jto 

(clearly,  the  door-stop  rules  apply  to  9j{t)).  Equation  (2-9)  is  the  equation  of  motion  for 
the  expected  truth  value  for  a  variable.  It  is  interesting  to  note  that  the  term  enclosed  in  curly 
braces  in  Equation  (2-9)  is  the  difference  in  the  probabilities  at  times  t  and  to  that  the  assertion 


or 

where 
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associated  with  variable  j  is  false.  Observe  that  Equation  (2-9)  can  also  be  rewritten  as 

(rjit))  =  {fj{to))  +  2{cos2  djit)  -  cos2  0,.(to)} 

in  which  the  term  enclosed  in  curly  braces  is  the  difference  in  the  probabilities  at  times  t  and 
to  that  the  assertion  associated  with  variable  j  is  true.  Consequently,  the  equation  of  motion 
for  (fj)  is  (as  might  be  expected)  directly  related  to  the  temporal  changes  in  the  probability 
that  variable  j  is  true  or  is  false. 

Thus  -  when  given  6j  -  if  the  expected  truth  value  for  variable  j  and  9j  are  known  at  time 
to,  then  Equation  (2-9)  can  be  used  to  predict  the  expected  truth  value  for  variable  j  at  time 
t.  As  a  simple  illustration  of  this,  assume  that:  (i)  9j  has  a  constant  value  of  -1.5°  per  hour; 
and  (ii)  9j{to)  =  30°.  Then  from  Equation  (2-10)  we  have 

9j{t)  =  30°  +  (-1.5°/hr.)  Tdt 

Jto 

=  30  -  1.5  •  {t  -  to)  deg  . 


Since 

(fj(to))  =  cos2  30°  -  sin^  30°  =  0.50, 

then  -  using  Equation  (2-9)  -  we  have  for  the  associated  equation  of  motion  the  (time  dependent) 
expression  given  by 

(fj(t))  =  0.50-2{sin2[30-1.5-(t-to)]-0.25} 

=  1.00-2sin2[30- 1.5-(t-to)]. 


For  example,  if  t  —  to  =  5hrs,  then  (Tj(to  —  5))  —  0.7071. 


Before  concluding  this  section,  we  point  out  that  the  equation  of  motion  for  the  expected 
truth  value  (fs)  is  (essentially)  the  averaged  sum  of  the  equations  of  motion  for  each  variable 
in  the  set  S.  This  is  easily  seen  since 


^  J_ 

dt  |51  dt 

which  upon  integration  yields 

=  {Tsih))  +  ]|i  ^  {('i'j(O)  -  {rj{to))} 

=  (fsito))  “  jlj  ^>^^°)}  ■ 
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2.12  THE  TRUTH  AUTOCORRELATION  FUNCTION  AND  CONDITIONAL 
PROBABILITY 

The  correlation  between  two  truth  states  \6j)  and  for  a  logical  variable  is  given  by  the 
value  of  the  associated  truth  autocorrelation  function 

{epj)=cos{e'-9,). 

(This  expression  is  the  direct  consequence  of  the  orthonormality  of  the  truth  basis  ket  vectors) . 
Since  0  <  6j,6'j  <  then  0  <  9j)  <  1.  Thus,  the  closer  the  function’s  value  is  to  1 

(0)  the  more  (less)  similar  the  states  are  and  the  better  (worse)  their  correlation.  When  the 
value  of  the  function  is  1,  i.e.  0'-  =  dj,  (is  0,  i.e.  6j  -  Oj)  =  ±f )  the  truth  states  are  exactly 
correlated  (are  completely  uncorrelated).  Note  that  due  to  the  constraint  imposed  by  the  door¬ 
stop  rule,  this  latter  case  of  complete  uncorrelation  can  occur  only  when  =  |— )j 

and  \6j)  =  |0j)  =  or  vice  versa. 

The  truth  autocorrelation  function  can  be  especially  useful  for  analyzing  correlations  be¬ 
tween  the  truth  states  for  a  variable  in  the  following  cases: 

1.  the  correlation  between  a  variable’s  prior-state  and  post-state,  i.e. 

if  \e'j)  =  d,[1^  \ej)  ,then  (0' I  9j)  =  (^,-|  \9j) ; 

2.  the  correlation  between  a  variable’s  time  evolving  truth  state  at  different  times,  i.e. 

if  |6»j(ii))  =  Uj(fi,to;<9j)  i0j(to))  Uhen 
{9j{ti)\  9j(to))  =  {9j{to)\Ujiti,to-9j)  \9j{to)); 

and 

3.  the  correlation  between  a  variable’s  truth  state  and  its  resultant  state  after  a  combination 
of  rotations  and  time  evolutions,  i.e. 

if  l^')  =  \0j) ,  then  (0' I  9j}  =  {9j\  (4[^])^  l^,')  - 

This  notion  can  be  extended  to  define  the  truth  autocorrelation  function  for  two  general 
inference  systems 

m  =  d4gm 

and 

to  be 

(^'l  ^)  =  {Q\  (aA[gq)^(a.[g])|0). 
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Before  closing,  we  note  that  the  square  (modulus)  of  the  truth  autocorrelation  function 

\(ep,)f  =  cos\e'^-ej) 

may  be  interpreted  as  the  conditional  probability  p{0'j  \  6j),  i.e.  the  probability  that  variable 
j  is  in  truth  state  given  that  it  was  in  truth  state  |0j).  Similarly,  the  square  (modulus) 

|($'|  may  be  interpreted  as  the  conditional  probability  p( 4''  |  3')  that  the  inference  system 
is  in  truth  state  l^f')  given  that  it  was  in  truth  state 
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3  TRUTH  DISTRIBUTION  FUNCTIONS  AND  OPERATORS  FOR 
PROBABILISTIC  INFERENCE  SYSTEMS 


The  formalism  developed  in  this  section  is  an  additional  and  natural  feature  that  is  available 
for  use  by  Dirac  networks  and  is  not  generally  available  to  traditional  inference  systems.  It  can 
be  especially  valuable  for  physical  (e.g.  optical)  "implementations  of  Dirac  networks  in  which 
actual  measurements  are  made  upon  the  system. 


3.1  THE  TRUTH  DISTRIBUTION  FUNCTION  FOR  A  LOGICAL  VARIABLE 


In  addition  to  the  two  dimensional  truth  vector  space,  an  uncountable  set  of  position  ket 
(and  bra)  basis  vectors  (and  |^¥?j||^can  be  associated  with  the  logical  variable  in 

a  general  inference  system.  The  scalar  product  for  position  vectors  for  variable  j  is  given  by 

(ipj\  if'j)  =  6  {^pj  -  ip'^)  . 


Here,  S  (^(pj  —  (pj^  is  the  Dirac  delta  function  defined  by 


S  (<pj  -  ¥>' ) 


1  when  (pj  =  (p'j 
0  when  ipj  (p'j 


and  has  the  property  that  for  any  function  / 

j  f  ^  -  ‘p'j)  ^‘Pj  =  f  (p'j)  • 

Unless  otherwise  stated,  aU  upper  and  lower  integration  limits  are  assumed  to  be  +00  and  —00, 
respectively. 


A  normalized  position  probability  amplitude  ket  vector  well  as  its  dual  bra  vector 

can  also  be  assigned  to  the  variable  and  used  to  represent  the  probability  amplitude 
associated  with  some  property  v  of  its  truth  state  (e.g.  fj.).  The  ket  vector  can  be  projected 
into  the  position  space  of  the  -variable  by  forming  the  scalar  product  (jPj^  ‘‘I’j')-  product 

is  called  the  wave  function  for  the  -variable’s  truth  state  and  '^j)\  associated 

probability  distribution  function  for  that  truth  state,  i.e.  the  truth  distribution  function.  Thus, 
dipj  represents  the  probability  for  finding  the  property  v  for  the  truth  state  l^j)  in 
the  interval  dp>j. 
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The  closure  operator  for  the  variable  is  defined  as 

/  {‘Pjl  =  y- 

As  the  notation  suggests,  this  operator  is  an  identity  operator  comprised  of  projections  into  both 
the  position  ket  and  bra  vector  spaces.  It  is  especially  useful  for  producing  probability  distri¬ 
bution  functions  (and  moments)  when  applied  to  scalar  products  of  position  amplitude  vectors. 
This  utility  is  demonstrated  in  the  proof  for  the  next  theorem  which  provides  a  normalization 
guarantee  for  the  truth  distribution  function. 

Theorem  40  If  is  normalized,  then  so  is  (ipj 

Proof: 

1  =  (V’jI  i’j)  =  h  \'>Pj)  =  J  (^i|  ^j) 

=  J  I  I  'i’j)  =  /  1  ■ 

Q.E.D. 

We  also  define  two  operators  useful  for  describing  the  properties  of  truth  distribution  func¬ 
tions.  The  first  of  these  is  the  (necessarily  Hermitean)  position  operator  (p^  for  the  variable. 
The  action  of  pj  upon  position  vectors  provides  a  formalism  for  position  “measurements”  via 
the  eigenvalue  equation 

^3  h) = h)  ■ 

(Since  we  are  not  concerned  with  matrix  representations  for  ipj ,  there  is  no  confusion  concerning 
the  meanings  of  (pj  and  (pj).  The  expected  position  for  the  variable  (yj'j  is  defined  as 

{^j)  =  (V'jI  'fj  {i’j)  • 

The  next  lemma  will  be  of  value  to  our  discussion  of  the  properties  of  the  truth  distribution 
function. 

Lemma  41  (The  Moments  Lemma)  For  each  positive  integer  n, 

{(pf)  =  (V’jI  \^j)  =  |(^i| 

Proof: 

=  j  j  {^j\  ^'j)  ki)  M 
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=  //¥’"  (V’j  I  <5  [v'j  -  Vj)  {‘pj I  ^j)  dp'jd^pj 


Q.E.D. 

The  second  operator  of  interest  is  the  •positional  uncertainty  operator  Atpj  for  the  vari¬ 
able’s  truth  state  and  it  is  defined  as 


Aifj  =  <pj  -  (<pj)  . 

As  the  next  theorem  shows,  this  operator  is  useful  for  providing  a  measure  of  the  dispersion 
that  is  associated  with  a  variable’s  truth  distribution  function. 


Theorem  42  If 
then 


and  w 


I  is  a  normalized  truth  distribution  function  •with  v  as  its  mean, 

{vi)  =  ■' 

its  standard  deihation. 


Proof:  If  is  a  normalized  truth  distribution  function  with  mean  v,  then  (^0^^  = 

1/  follows  directly  from  the  Moments  Lemma  when  n  =  1.  Also, 

=  {'>pj\h  {pj  -  {vj)f  h\^j) 


J  j  (^i|  ^’j)  (v’il  {^3  -  Vi)  ^j)dp'jdPj 
(  ^ 

[  J 


=  J I 


i)  {Pi 


■  + 


V  (^i)  ) 

(  p]-  \ 


{fP3\'^3)dp)d<p^ 
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=  J  ^‘Pr 

Therefore,  {^PjY  \^j)  ^he  variance  and  y  (Av9j^^  is  the  standard  deviation. 

Q.E.D. 

The  properties  of  a  logical  variable  in  a  probabilistic  inference  system  can  be  more  completely 
specified  by  combining  the  probabilistic  information  resident  in  its  truth  state  vector  with 
the  associated  position  probability  distribution  information  contained  in  a  position  amplitude 
vector.  This  combination  can  be  achieved  for  each  variable  in  one  of  two  ways  to  create  either 
a  TT  state  or  an  e  state.  We  again  emphasize  that  such  states  are  of  particular  value  for  physical 
implementations  of  Dirac  networks. 

3.2  TT  STATES 

The  TT  state  for  the  logical  variable  is  the  tensor  product  state  \9j)  The  tt  state  |^>) 
for  an  n- variable  inference  system  is  the  tensor  product 

Lemma  43  If  each  position  amplitude  in  an  state  is  normalized,  then  the  tt  state  is  normalized. 
Proof:  Since  each  position  amplitude  and  truth  state  is  normalized,  then 

($1  $)  =  (0i|  0i)  (^,1  .  (0„|  0„)  (^„|  =  1  •  1  •  •  •  1  •  1  =  1- 


Q.E.D. 

3.3  THE  MOMENTUM  OPERATOR  FOR  A  LOGICAL  VARIABLE  AND  THE 
RIGID  TRANSLATION  OF  TRUTH  DISTRIBUTION  FUNCTIONS 

Complications  arise  when  a  variable  is  influenced  by  other  variables  or  has  evolved  in  time. 
In  this  case,  the  rotation  and/or  time  evolution  of  the  variable’s  prior  truth  state  induced  by 
the  action  of  the  associated  rotation  and/or  time  evolution  operators  must  be  accompanied  by 
a  consistent  change  in  the  position  amplitude  vector.  The  purpose  of  this  section  is  to  introduce 
a  translation  operator  whose  action  will  produce  such  a  change  in  a  position  amplitude  vector. 

To  formally  define  this  operator,  it  is  first  necessary  to  assume  that  a  (“fictitious”  and 
necessarily)  Hermitean  momentum  operator  pj  can  be  assigned  to  the  variable  in  an  inference 
system  and  that  it  obeys  the  commutation  axioms  given  by: 

\Pj,Pk]  =  0 
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and 

^j,Pk\  =iSjk- 

Whenever  the  last  axiom  applies,  it  is  a  simple  task  to  evaluate  commutators  of  the  form 
^^j,F  (pj)  ,  where  F  (pj)  is  any  function  of  the  operator  pj.  Such  an  evaluation  will  be  needed 
for  determining  the  properties  of  the  translation  operator  and  can  be  readily  obtained  using  the 
following  theorem  [2] (since  this  theorem  is  well-established,  it  is  stated  here  without  proof); 

Theorem  44 

The  change  of  the  position  amplitude  vector  1^^)  in  position  space  for  the  logical  variable 
that  is  consistent  with  the  total  cumulative  rotation  and  time  evolution  of  its  truth  state 
vector  \6k)  induced  by  the  action  of  some  dc[&\  is  achieved  by  the  action  upon  IV’fc)  of  the 
exponential  translation  operator  defined  by 

where  fik  is  the  distance  in  position  space  that  corresponds  to  the  cumulative  angular  quantity 
The  next  theorem  defines  the  action  of  this  operator  upon  a  position  amplitude  vector  in 
position  space. 

Theorem  45  {ipk\  Sk  {0k)  iV'fc)  =  i‘fik  "  0k\  ’^k)  ■ 

Proof:  Since  the  translation  operator  is  an  exponential  function  of  pk,  the  previous  theorem 
may  be  applied  to  yield 

[iPk,Sk{0k)]  =  (PkSk{0k)-Sk{0k)(Pk  =  i^^^ 

=  K-'i0k)Sk{0k) 

=  0kSk{0k) 

so  that 

(p„Sk{0k)  =  Sk{0k)  {(pkF0k}- 

Then 

VkSk  {0k)  Iv’fc)  —  {0k)  {Vk  +  0k}  Wk) 

=  Sk{0k){Vk  +  0k}\^k) 

=  Wk  +  ^fc}  Sk  {0k)  I'Pk)  ■ 

This  implies  that 

Sk{0k)  \Fk)  =  \^k  +  0k) 

or  equivalently 

{‘Pk\si{0k)  =  {¥>k+0k\- 
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Since  §1  {P/.)  =  =  e‘^'=P'‘,  then  5^  (— /5fc)  =  Sk  (Pk)-  Using  this  in  the  last  equation  gives 

{v>k\Sl{-Pk)-={v>k\SkiPk)  =  {^k-fik\- 

Forming  the  scalar  product  with  \-ipiP  gives  the  desired  result: 

{<Pk\Sk  iPk)  l^fc)  =  {‘Pk  -  Pk\  i’k)  ■ 


Q.E.D. 

Thus,  the  projection  of  Sk  (Pk)  IV’t)  its  position  space  produces  a  wave  function  {ipk  —  Pk\  V’fc) 
which  is  the  rigid  translation  of  {ipk\  V’fc)  (along  the  ipk  axis  in  the  graph  of  {ipk\  '’Pk))  Pk- 
our  development,  the  specification  of  a  tt  state  requires  that  for  every  variable  k  which  experi¬ 
ences  a  rotation  and/or  a  time  evolution  there  is  a  single  translation  operator  Sk{Pk)  that  is 
applied  to  its  probability  amplitude  vector  \ipk)  which  rigidly  translates  it  along  the  ipk  axis  a 
distance  Pk  that  corresponds  to  the  total  angular  rotation  and/or  time  evolution  ^k  experienced 
by  the  associated  prior  truth  state  |0fc).  Such  a  specification  is  said  to  be  consistent  and  the 
associated  string  of  rotation,  time  evolution,  and  translation  operators  is  a  consistent  string. 
Clearly,  the  magnitude  of  Pk  remains  constrained  by  the  door-stop  rule  requirement  for  ^k- 

Consider  now  the  unitarity  of  the  translation  operator  and  the  effect  of  its  action  upon  the 
normalization  of  1^^). 

Lemma  46  Sk  (Pk)  is  unitary. 

Proof:  Because  pk  is  Hermitean,  we  can  write 

§1  (Pk)  Sk  iPk)  = 

Since  \pk-,'Pk\  =  0,  then  Glauber’s  theorem  applies  so  that 

si  {Pk)  Sk  {Pk)  =  =  ifc. 

Similarly  for  Sk  {Pk)  Sl  {Pk)-  Q-E.D. 

Lemma  47  If  \ipk)  is  normalized,  then  so  is  Sk  {Pk)  \'’Pk)  ■ 

Proof:  This  is  a  consequence  of  the  unitarity  of  the  translation  operator: 

1  =  (^fcl  Pk)  =  (^fcl  ifc  \Pk)  =  {Pk\  Sk  {Pk)  Sk  {Pk)  \Pk)  > 

or  equivalently 

1  =  {ir,\Sl{Pk)ikSk{Pk)\Pk) 

=  j  {Pk\  si  {Pk)  Wk)  (v’fel  Sk  {Pk)  \Pk)  dPk 

=  I  {yrk\ Sk  {Pk)  \Pk)*  i’Pkl  Sk  {Pk)  \Pk)  d’Pk 
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=  j \{Vk\Sk{l3k)\i>k)\^ 

=  j \{v>k  -  Pk\  ■<Pk)\‘^  d<pk. 

Q.E.D. 

Thus,  the  normalization  of  a  position  amplitude  vector  is  preserved  under  the  action  of 
the  translation  operator.  The  following  theorem  describes  the  properties  of  truth  distribution 
functions  when  such  translations  are  involved. 

Theorem  48  If  \{^k\  ^fc)l^  normalized  truth  distribution  function  with  mean  v  and  stan¬ 
dard  deviation  Sk,  then  Sk  {Pk)  \‘4’k)\  ^  rigid  translation  of\{ipk\  with  mean  v  —  ^k 

and  standard  deviation  Sk- 

Proof:  Since  translation  operators  perform  rigid  translations,  then  (yjfel  (/?fc)  \ipk)  is 

a  rigid  translation  (along  the  axis)  of  \{<Pk\‘^k)\^  therefore,  has  the  same  standard 
deviation  as  \  {^Pk\  V’fc)P-  Since  the  translation  is  along  the  axis,  its  mean  is  shifted  from  that 
of  the  untranslated  truth  distribution  function  by  the  amount  /?fc-  Q.E.D. 

Before  closing  this  section,  it  is  important  to  note  that  if 

iMfc)  =  Sk  (/3fc)  IV'fc) , 

then 

(Aifcl  <Pk  W)  =  (V’fcl  (/?fc)  ^Sk  iPk) 

is  the  expected  position  of  the  translated  position  amplitude  |/i*.).  Therefore,  in  general,  {^f.) 
can  represent  the  expected  position  for  either  an  independent  (untranslated)  variable  or  a 
translated  variable.  It  will  be  clear  from  the  context  in  which  this  notation  is  used  whether  one 
or  the  other  or  both  possibilities  apply. 

3.4  THE  TT  STATE  FOR  A  GENERAL  INFERENCE  SYSTEM 

Even  though  the  order  of  application  of  rotation  and  time  evolution  operators  is  important 
for  the  specification  of  truth  states,  this  is  not  the  case  for  translation  operators.  This  follows 
from  the  fact  that  we  employ  only  one  translation  operator  for  each  variable  that  is  acted  upon 
by  rotation  and/or  time  evolution  operators  and  each  such  translation  operator  is  applied  to 
the  associated  position  amplitude  vector  only  after  all  rotation  and  time  evolution  operators 
have  been  appropriately  applied  to  their  associated  truth  state.  Consequently,  the  following 
commutation  relations  hold  for  translation  operators: 

Lemma  49 

[5,(/3,),5fc(/?fc)]=0, 

and  similarly  for  the  associated  adjoint  operators. 


3-7 


NSWCDD/TR-01/103 


Proof:  This  follows  directly  from  the  fact  that  the  momentum  operator  is  Hermitean  and 
that  \pj,pi]  =  0.  Q.E.D. 

Also  note  that  since  the  influence  and  momentum  operators  are  deflned  for  distinct  vector 
spaces,  then  [aj,pk]  =  0  so  that  all  commutators  between  rotation  and  translation  operators  and 
their  adjoints,  as  well  as  between  time  evolution  and  translation  operators  and  their  adjoints, 
vanish.  Thus,  consistent  strings  can  always  be  expressed  as  a  catenation  of  a  string  of 
its  rotation  and/or  time  evolution  operators  ^  with  that  of  its  translation  operators  A  The 
ordering  operator  &„[]  can  be  applied  to  such  a  consistent  string  with  the  following  useful  result 
(the  order  of  application  of  translation  operators  is  unimportant  since  they  are  applied  after 

dAO])-- 

da  [^]  =  da  [^]  S'. 

The  TT  state  vector  |$)  for  an  inference  system  with  both  rotational  influences,  time  evolutions, 
and  associated  rigid  translations  can  be  written  in  the  general  form 

1$)  =  a[^]  10)  =  d[S]  I©)  , 

where  |©)  and  |$)  are  tensor  products  of  all  independent  and  prior  truth  states  and  position 
amplitudes,  respectively.  We  shall  refer  to  such  a  state  as  a  general  tt  state  and  the  system 
it  represents  as  a  general  tt  inference  system.  Observe  that  the  last  expression  may  also  be 
written  as 

l$)  =  iE)a[^]|T)|A)^|Q), 

where  |E)  and  |A)  are  the  tensor  products  of  the  truth  states  and  of  the  position  amplitudes 
for  aU  the  independent  variables,  respectively,  and  |T)  and  |Q)  are  the  tensor  products  of  the 
prior  truth  states  and  position  amphtudes  for  all  the  variables  effected  by  rotation  and/or  time 
evolution  operators.  A  normahzation  guarantee  for  general  tt  states  is  provided  by  the  next 
theorem. 

Theorem  50  If  each  position  amplitude  vector  in  a  general  tt  state  is  normalized,  then  that 
general  it  state  is  normalized. 

Proof:  Let  S=  Sk  (Pk)  ■  ■  ■  Sm  {Pm)-  Since  truth  states  are  always  normalized  and  from  the 
premise 

{A|  A)  =  (fi|  fi)  =  1, 

then  the  unitary  properties  of  and  the  translation  operators  yields 

($i$)  =  (E|(T|(a[^])^a[^]|T)|E)- 

{A|  {Q\ Si  iPJ  ■■■si  iPk)  Sk  iPk)  ■■■Sm  {Pm)  \^)  |A) 

=  {E|(T|i|T)|E)(A|(fi|U---ifc|fi)lA) 

=  (E|(T|T)|E)(A|(n|Q)lA) 

=  (H|H)(A|A) 

=  1. 
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Q.E.D. 

3.5  TT  STATE  AUTOCORRELATION  FUNCTIONS 
Consider  the  general  tt  states  given  by 

|$,)  =  |E)4[^]|T)  \A)m 

and 

|$;)  =  |E}4[^']|T)lA).$/|fi), 

where  |E),  |T),  [A),  and  \Q.)  are  as  defined  in  the  last  section.  The  associated  tt  state  autocor¬ 
relation  function  is 

=  £-V. 


Thus,  a  TT  state  autocorrelation  function  factors  into  the  product  of  a  truth  state  autocorrelation 
function  £  and  a  position  amplitude  autocorrelation  function  V. 

Let  us  now  evaluate  each  of  these  factors.  If 

dM\r)  =  d,m  /HM 


and 


then 


£  = 

where  /(^  and  are  /  evaluated  for  the  post-state  angles  obtained  after  the  applications  of 
5o.[^]  and  dv[&],  respectively. 

To  determine  V  assume  that 


and 

Then 


^\Q)  =  5i(/3i)  IV-i)  S2W2)  \i’2)  •  ■  •  SniPn)  \^n) 

|f2)  =  S’M  ItAi)  1^2)  ■  •  •  S'M  IV'J . 
V  =  T>1  ■  T>2  -  ■  ■  Vn, 
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where 

=  /  I  )  {^3  I  ^3  {^i  )  I 

=  J  (^ipj  -  /?'|  3pj)  {ipj  -  /3j|  d^Pj,!  <j<n 

is  the  position  amplitude  autocorrelation  function  for  the  variable.  Thus,  T>  is  the  product 
of  position  amplitude  autocorrelation  functions  for  all  the  logical  variables  defined  by  \Q)  (or 
|T)  since  in  either  case  the  logical  variables  are  the  same). 


3.6  POSITION  MEASUREMENTS  FOR  GENERAL  tt  INFERENCE  SYSTEMS 

As  with  truth  operators,  the  position  operators  for  the  position  amplitudes  associated  with 
the  logical  variables  of  a  general  tt  inference  system  can  be  used  to  define  generalized  position 
and  uncertainty  operators  for  any  subset  of  system  variables.  If  N  is  the  set  of  variable  indices 
for  an  inference  system,  then  let  0  R  C  AT  define  a  subsystem  with  truth  state  vector  space 
dimension  2l®l.  The  generalized  position  operator  Pb  for  the  inference  subsystem  with  variables 

The  generalized  positional  uncertainty  operator  for  the  inference  subsystem  defined  by  B  is 

(3-2) 

The  expected  position  and  an  associated  expected  value  for  the  variance  for  any  subsystem  of 
a  general  tt  inference  system  can  be  evaluated  using  the  next  theorem. 

Theorem  51  If  |$)  is  the  general  it  state  for  a  general  tt  inference  system  with  a  subsystem 
defined  by  the  variable  index  set  B,  then 

I  I  j'eB 

/  1  \  ,2 
($1  Ub  1$)  =  j  E  (^i|  kj)  • 

Proof:  Let  N  index  the  system  variables  and  let  |$)  =  |©)  |/3)  |A),  where  |0)  is  the  tensor 
product  of  all  truth  state  vectors, 

\P)  =  ©iew-B  Ix/) 
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and 

lA)  =  ©fees  iMfc)  ■ 

Here  ©  denotes  tensor  products  and  |x/)  and  represent  position  amplitudes  for  both  in¬ 
dependent  variables  (e.g.,  |Xr)  =  llAr))  and  translated  amplitudes  (e.g.,  |Xr)  =  (/^r)  IV’r))- 

Then 


($|Pb|$)  =  (0|(/31(A|PB|A)|;3)|e) 

=  (010)(/3|^}(A!Pb|A) 

=  (a|Pb|A) 

=  (®fc€B  ifJ'kl)  j  (©fees  iMfc)) 

=  rli  E  h)  (nfceB-{i}  (Mfcl  ^^k))  } 


i€B 


'  '  j€B 


The  result  for  ($|  Ug  |$)  follows  from  Theorem  48.  Q.E.D. 


Therefore,  when  the  position  amplitudes  for  a  general  tt  inference  system  are  normalized,  it 
is  a  straightforward  matter  to  evaluate  the  average  mean  position  and  associated  uncertainty 
for  a  subsystem  in  terms  of  the  known  (translated  and  untranslated)  means  and  uncertainties 
associated  with  each  subsystem  variable.  Also,  it  is  important  to  notice  from  this  theorem 
that  ;  (i)  the  values  for  ($|  Pb  1$)  and  ($[  Ub  |$)  do  not  depend  upon  the  truth  states;  (ii) 

if  represents  the  probability  distribution  for  the  truth  value  of  variable  j  with 

—  [fj),  then  ($|Pb|$)  =  (^b);  and  (iii)  {^\Ub  I^)  provides  a  measure  of  the  truth 
uncertainty.  Consequently,  it  is  possible  to  formally  represent  a  Dirac  network  strictly  in  terms 
of  position  amplitude  vectors. 


3.7  £  STATES 

3.7.1  The  £  StateFor  A  Logical  Variable 

An  £  state  for  the  logical  variable  is  the  entangled  state 

\^j-,  V’f )  =  cos  dj  |-H)j  )  +  sin  9j  1-)^-  ,  (3-3) 

where  the  are  normalized  -  but  not  necessarily  orthogonal  -  position  probability  amplitudes 
such  that  l  dipj  are  the  probabilities  for  finding  the  distinguished  positions  (pf  in  the 


3-11 


NSWCDD/TR-01/103 


interval  d(fj,  respectively.  These  distinguished  positions  are  associated  with  properties  of  the 
±1  truth  values  for  the  variable  and  the  functions  are  probability  distributions 

for  these  properties  with  variances 

=  (3-4) 

and  mean  values  ,  ,  .  > 

=  (3-5) 

Thus,  the  probability  distribution  function  for  the  e  state  of  Equation(3-3)  is  the  weighted  sum 

of  these  distributions  given  by 

|(^j|  =  (3-6) 

and  may  be  interpreted  as  the  probability  distribution  for  finding  the  associated  system  in  a 
state  described  by  the  distinguished  position  <p'j  or  the  distinguished  position  tpj  when  the 
events  associated  with  the  distributions  are  independent. 


3.7.2  The  g  State  For  A  General  Inference  System 

Using  previously  developed  notation  (see  Ekjuations  (2-1),  (2-2),  and  (2-3)),  the  general  e 
state  l^')  for  a  general  (n- variable)  s  inference  system  is  the  entangled  state 

(3-7) 

=  d.m  E  /(‘^)HI^-) 

=  E  /o-(‘^)  1^^) 

where  is  /(ui)  evaluated  according  to  Equation(2-3)  using  the  post-state  angles  induced 

by  da  f^l  and 

•  ■  ■  1C") 

is  the  amplitude  associated  with  the  probability  of  finding  the  distinguished  positions  in 
d(Pi,(P2^  in  dp2,---,  and  in  dp^-  Observe  that  only  strings  ^  of  rotation  and/or  time 
evolution  operators  appear  in  this  definition.  Translation  operators  are  not  used  in  order  that 
the  amplitudes  remain  fixed  in  their  position  spaces.  The  probability  distribution  function 
for  Equation(3-7)  is  the  weighted  sum 

\{p^\  =  E  C)l^  > 


3-12 


NSWCDD/TR-01/103 


where 

KvpJ  V-Jl'  =  Kv^il  V'r>H(v’2l  >i'  •  •  •  \{Vn\  C")l' 

is  the  probability  distribution  function  associated  with  finding  the  distinguished  positions  as 
described  above. 

Theorem  52  A  general  e  state  is  normalized. 

Proof: 

ujGL^ 

oj'^Ln  ujGLn 

=  E  [/(^)]^  i^w) 

wSLjv 

=  E  [/(")P 

w€l/iv 

=  1. 

Q,E.D. 

3.7.3  £  State  Autocorrelation  Functions 
Consider  the  general  e  states  given  by 

\^<t)  =  4[^]  E  /H 

u;€l/jv 

and 

K)  =  a.[^']  E  5HI‘^)iu- 

wGLjv 

The  associated  e  state  autocorrelation  function  is 

uj'GLiv  wGL^v 

=  E 

=  E  /  (v’wl 
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where 

j  =  j  j  {^2\i2)*  {V2\i^2)d^2 

■  J  {‘Pn\  ^n)*  {Vn\  ^n)  d^n- 

3.7.4  Post-selected  g  States  And  Conditional  Probability  Distributions 

A  post-selected  e  state  is  one  which  has  been  projected  onto  one  of  its  (possible)  truth 
state  vectors.  Specifically,  the  post-selected  e  state  for  the  state  given  by  Equation  (3-3) 
is  defined  as  the  scalar  product 

S  (9;|  «,;</■*)  (3-8) 

=  [cos^J,  (-f-l  +sin^,,  {-|j  1^,;!/’*) 

=  cos cos Oj  -t-  sin  9'j  sin  9 

which  can  be  interpreted  as  the  conditional  probability  amplitude  for  finding  variable  j  in  truth 
state  if  it  is  in  state  \9j\ipf'^.  Projecting  this  state  into  its  position  space  yields 

I  9'^  :  9 =  cos 9'j  cos 9j  ((pj \i’'j)  +  sin 9'j  sin 9j  (pj \'>Pj), 
so  that  the  associated  conditional  probability  distribution  function  is  given  by 

=  cos^ (9^ cos^ +  (3-9) 

sin^  9'j  sin^  9j  |  (^ipj  |  )  |  d" 

2  cos  9'j  cos  9j  sin  9'j  sin  9j  Re  {^Pj  |  ip'j  ^  {pj  |  V'7  )  • 

This  may  be  interpreted  to  be  the  probability  distribution  function  for  finding  (or  measuring) 
a  state’s  position  values  to  be  Pj  or  p~  conditioned  upon  the  fact  that  the  truth  state  is 

when  it  is  in  state  \^9j-,xpfy  From  this  it  is  readily  seen  that  in  addition  to  being  a  weighted 

sum  of  the  distributions-  unlike  the  probability  distribution  of  Equation  (3-6)  -  the 

conditional  probability  distribution  for  a  variable’s  post  selected  e  state  includes  an  interference 
term  (the  third  term,  provided  9'j,  9j^Q,^).  This  term  can  significantly  modify  the  associated 

distribution  from  that  of  a  simple  weighted  sum  of  the  ■tpfy  distributions.  Also,  notice 

that  this  interference  term  can  be  written  equivalently  as  Atj  •  At'-  Re  (pj |  'j  (^Pj  | 

This  clearly  relates  this  interference  to  the  truth  uncertainties  At  j  and  Ar'-  that  are  associated 
with  \9j)  and  respectively. 
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The  generalized  \(f>)  =  1^)  post-selected  e  state  for  the  state  |5'o-)  given  by 

Equation(3-7)  is 


(3-10) 


Y  /HMIV’a;) 

oi'SLjv 

uj£Ln' 

ui€Lf{ 


where  /a  (a;)  is  as  defined  above.  By  extension,  this  is  the  general  conditional  probability 
amplitude  for  finding  the  inference  system  in  state  |(^)  when  it  is  in  state  I’^cr).  The  associated 
conditional  probability  distribution  function  is 


2 


tjj£Lpf 


Since  this  conditional  probability  distribution  function  is  the  square  modulus  of  a  sum,  it 
exhibits  multiple  interference  terms. 

Thus  -  in  a  sense  -  post-selection  “erases”  specific  truth  state  information  from  an  s  state. 
The  result  of  such  a  loss  of  specific  truth  state  information  is  to  produce  an  interference  “pattern” 
in  the  associated  distribution  function  which  -  as  shown  above  -  reflects  the  uncertainty  in  the 
truth  values.  This  is  in  stark  contrast  to  a  non-post-selected  (i.e.  a  non-erased)  e  state  which 
contains  specific  truth  state  information:  its  probability  distribution  is  strictly  a  weighted  sum 
which  exhibits  no  such  interference  pattern. 


3.7.5  Position  Measurements  For  General  e  Inference  Systems 

Let  Pb  and  Ub  be  the  previously  defined  (see  Equations  (3-1)  -  (3-2))  generalized  position 
and  generalized  positional  uncertainty  operators,  respectively.  The  expected  values  for  these 
operators  for  a  general  e  inference  system  can  be  evaluated  using  the  following  theorems. 

Theorem  53  If  is  the  general  e  state  for  a  general  e  inference  system  vhth  a  subsystem 
defined  by  the  variable  index  set  B,  then 

(^'^l  Pb  =  j;|i  cos^  9j  +  ifj  sin^  0^] 

and 

('^£.1  Ub  |«'^)  =  Z!  +  Y  ’ 

where  the  Oj ’s  are  post-state  truth  angles. 
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Proof:  Let  N  =  {1, 2,  ■  •  •  ,n}  index  the  system  variables  and 

where  the  0j’s  are  the  post-state  angles  induced  by  [^]-  Then 


(4',|Po|4',)  = 


i&B 


n^eAT 


jeB 
I  I  jes 

=  ^  E  H  ^  I  ^  I]  ■ 

'  '  j€B 


[cos  Oj  l-r)^.  I^t )  -h  sin  Bj  1-)^. 

=  E  I  {^j  I 


J€B 


=  ^  E  [P't  ^i\  ’ 


j^B 


and  similarly  for  the  expected  variance.  Q.E.D. 


Theorem  54  If  |d> :  4'<r)  is  a  post- selected  e  state  for  a  general  e  inference  system  with  variable 
index  set  N  and  a  subsystem  defined  by  the  variable  index  set  B  Q  N,  then 

{(f)  ■.^^\PB\<f>-  ^a)  =  \^j  ■  ^f)  ■ 

I  '  j€B 

TLkeN-{j}  {4>k  ■  ^k-,iPk  ■  ^fciV’fc  }] 

and 

{(f> :  Ub  \<f> :  Vfa)  =  E  \<i>j  :  Oji'ipf)  ■ 

'  '  j^B 

Uk^N-ij}  {^k  ■  9k-At  |<^fe  :  ^k\'^t% 

where  the  9j ’s  and  9^  ’s  are  post-state  truth  angles, 

{(fk  ■.9k\il)t\4>k-  'Pt)  =  <t>k  cos^  ^k  +  sin^  (i>k  sin^  9k  + 
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2  cos  (j)i^  cos  6k  sin  (j)k  sin  6k  Re  |  ) 

=  cos^  4>k  cos^  6k  +  sin^  (j)k  sin^  6k  + 

and 

(^(f)j  :  6j;  ipf  I  Xj  ^(t)j  :  6 =  xj  cos^  (j)j  cos^  6j  +  xj  sin^  (j)j  sin^  6j  + 

2  cos  <f>j  cos  6j  sin  (j)j  sin  6j  Re  (^tp^  |  Xj  1-07^ 

=  x+  cos^  (j)j  cos^  6j  +  xJ  sin^  (pj  sin^  6j  + 

(i)AT*j.AT,,Re(^+|Xi|^7) 

when  Xj  =  (pj  and  xf  =  pf  or  Xj  =  and  xf  =  and  where  At^,.  is  the  truth 

uncertainty  for  ^(pj^ 

Proof:  Consider 

Pb  1<^  :  \^j  ■  nfc€N-0-}  \4>k  ■  ^fc;  : 

'  ’  i€B 

where  the  6j's  and  6k  s  are  post-state  truth  angles.  Then 

{<p-.^„\pB\<p-^cr)  =  [n^€W  {<Pe  ■  Of,  0^  |]  • 

5Z  ^3  \^3  '■  )  nfceJV-0}  \<Pk  ■  Ok-,  0fc  ) 

_j€B 

=  T^l  ^3  \‘t>3  ■  ^3-^Pf)  • 

I  '  JGB 

nfc€N-{i}  (</>fc  :  Okitpkl  <Pk  ■  Ok-,'<Pk)l 

and  similarly  {oxUb-  The  desired  expressions  for  {(pj  :  6j-,3pf\Xj  \(pj  :  Oj-,ipf)  ,  Xj  G  [(pj,Apj'\, 

and  {(pk  :  ^fc;0fc  |  (Pk  '■  Ok-,P'k')  readily  obtained  using  Equations(3-8),  (3-5),  and  (3-4)  and 
theorem  4.  Q.E.D. 
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4  REPRESENTATIVE  EXAMPLE  APPLICATIONS 


The  purpose  of  this  section  is  to  provide  the  reader  with  examples  of  how  this  Dirac  algebra 
based  method  can  be  applied  to  assist  in  the  solution  of  Bayesian  inference,  systems  analysis, 
and  data  analysis  problems. 

4.1  BAYESIAN  INFERENCING 

In  this  first  section  we  contrast  a  Dirac  algebra  based  approach  to  Bayesian  inferencing 
with  contemporary  methods.  The  Dirac  truth  state  theory  developed  above  will  be  applied 
to  several  representative  problems  in  order  to  illustrate  how  the  methodology  can  be  used  in 
this  problem  domain.  These  problems  are:  (1)  a  comparison  of  results  obtained  using  the 
Dirac  method  with  those  obtained  from  the  Situational  Influence  Assessment  Model  (SIAM) 
system  for  two  very  simple  influence  networks;  (2)  applying  the  Dirac  method  to  determine  the 
probability  of  successful  engagement  for  a  cruise  missile  seeking  a  mobile  target;  and  (3)  using 
the  Dirac  method  to  solve  a  “textbook”  genetic  disease  problem.  The  Dirac  based  results  were 
obtained  using  a  prototype  software  system  DIRACNET  (see  Appendix  A)  which  implements 
the  Dirac  truth  state  theory  developed  above  (the  Dirac  theory  for  probability  distributions 
developed  above  has  not  been  implemented  in  this  software).  In  what  follows,  Dirac  algebra 
based  inference  systems  will  be  referred  to  as  Dirac  networks  or  -  for  short  -as  Dirac  nets. 

4.1.1  An  Overview  Of  Automated  Inferencing  And  Bayesian  Networks 

Since  the  1960’s,  a  variety  of  approaches  to  automated  inferencing  have  been  developed  (e.g., 
decision  trees,  decision  networks,  rule-based  systems,  causal  networks,  and  belief/Bayesian  net¬ 
works).  Extensive  discussion  of  the  problems  involved  and  approaches  to  their  solution  are 
available  elsewhere  [7,  8,  10].  These  approaches  are  typically  distinguishable  by  the  form  of 
logic/inference  they  employ  (two-valued  or  multi-valued  logics;  deductive,  inductive,  or  ab- 
ductive  inference;  and  temporal  reasoning).  The  various  problem  domains  to  which  these 
approaches  have  been  applied  can  generally  be  classified  as:  categorical  or  probabihstic  prob¬ 
lems;  explicit  decision  mechanisms  or  implicit  decision  making  problems;  and  symbolic  logic 
or  numerical  representation  problems.  Although  not  addressed  here,  experience  has  shown 
that  certain  of  these  approaches  are  better  than  others  for  addressing  specific  problem  domain 
applications. 

Early  rule-based  systems  generally  employed  straightforward  deterministic  logic  as  their 
methodological  basis.  As  the  level  of  sophistication  and  experience  increased,  non-deterministic 
systems  were  developed  which  relied  upon  probabihstic  and/or  weighting  schemes  for  their  op¬ 
eration.  These  weighting  schemes  are  either  extensions  of  probabilistic  methods  or  behave  as 
probability  measures  when  certain  restrictions  are  imposed  upon  them.  Consequently,  those 
systems  which  are  developed  as  belief  -  or  causal  -  networks  tend  to  be  predominantly  struc¬ 
tured  around  a  probability  calculus  as  the  foundation  of  their  operation.  Decision  problems 
also  employ  probabihstic  techniques  and  are  often  modeled  as  networks  in  which  the  topology 
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attempts  to  represent  those  significant  factors  and  influences  that  are  associated  with  the  de¬ 
cision  problem  of  interest.  Decision  trees  are  an  example  of  this.  Such  trees  are  designed  so 
that  the  leaf  nodes  of  the  tree  topology  serve  as  the  final  decision  variables  for  the  associated 
decision  problem. 

As  mentioned  above,  such  systems  are  also  classified  by  the  form  of  inference  they  employ 
for  their  operation.  Let  H  be  an  hypothesis  and  e  be  the  associated  evidence.  Using  this 
notation,  the  three  types  of  inference  can  be  easily  defined  as  follows: 


•  Deductive  Inference: 


If  H,  then  £. 
H 

Therefore,  e. 


•  Inductive  Inference: 

Possibly  H  accounts  for  s. 
e  and  if,  e  and  if,  •  •  •,  £  and  if. 
Therefore,  if  if,  then  £. 


•  Abductive  Inference: 


If  if,  then  £. 


Therefore,  if. 

Here  the  hypothesis  if  may  be  true  or  false  (or  may  acquire  a  value  from  a  set  of  possible 
values)  and  the  evidence  £  can  totally,  partially,  or  not  support  the  values  for  if.  The  simplest 
inference  systems  use  only  deductive  inference,  whereas  more  complex  systems  (e.g.,  belief  nets) 
may  use  both  deductive  and  abductive  reasoning  as  their  methods  of  inference. 

Both  causal  and  belief  nets  ideally  organize  their  logical  structure  in  the  form  of  directed 
acyclic  graph  (DAG)  representations  (however,  this  may  not  always  be  the  case  or  may  not  be 
possible  for  certain  applications).  Each  node  in  the  digraph  represents  a  proposition  and  the 
arcs  connecting  them  represent  the  influences  of  one  proposition  upon  another.  The  proposition 
may  be  true  or  false  -  or  it  may  assume  one  of  a  set  of  possible  values.  The  evidence  supplied 
to  the  system  can  support  or  deny  at  some  level  the  propositions  associated  with  the  system 
nodes,  as  well  as  their  influences  upon  other  node  propositions. 

Such  systems  are  initialized  by  assigning  to  each  node  a  measure  -  typically  a  probabilistic 
measure  -  which  reflects  a  prior  belief  or  probability  associated  with  each  node  (proposition). 
A  similar  measure  which  reflects  the  strength  of  the  influence  one  node  exerts  upon  another  is 
assigned  to  each  arc  in  the  network.  Using  this,  conditional  probabilities  that  child  nodes  are 
true/false  given  that  their  parents  are  true/false  are  then  determined.  After  initialization,  the 
inferencing  process  can  be  invoked  by  instantiating  the  system,  i.e.  by  establishing  the  truth 
or  falsehood  of  one  or  more  system  propositions  (nodes) . 
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Algorithms  are  usually  employed  to  control  and  update  the  probabilities  in  the  network 
using  TT  and  A  message  passing.  The  tt  values  are  determined  from  node  and  arc  probability 
measures  and  are  passed  as  tt  messages  from  parent  nodes  to  child  nodes  in  order  to  modify 
their  probability  measures.  The  A  messages  are  used  to  pass  A  values  from  child  nodes  to 
parent  nodes.  For  either  of  these  cases,  the  system  is  ideally  designed  so  that  the  updates  at 
each  node  result  only  from  the  information  received  from  adjacent  nodes.  In  a  system  for  which 
a  DAG  defines  its  topology  (e.g.  a  directed  tree),  the  message  passing  complexity  is  usually 
quite  manageable.  However,  if  the  network  topology  which  represents  the  inference  system  is 
not  represented  by  a  DAG  -  i.e.  when  the  underlying  digraph  contains  cycles  -  then  updating 
can  become  extremely  complex  and  cumbersome  since  it  may  involve  re-evaluations  of  all  joint 
probabilities  for  the  system. 

4.1.2  Dirac  Networks  As  Models  For  Inference  Systems 

The  structure  and  operation  of  Dirac  networks  is  similar  to  that  of  Bayesian  networks.  In 
particular:  (1)  both  employ  digraphs  to  represent  their  proposition/influence  topologies;  and 
(2)  both  pass  update  information  between  nodes  along  directed  arcs  of  influence.  However, 
as  discussed  above,  Dirac  networks  use  angular  measures  for  proposition  truth  state  vectors 
(Figure  4.1-1)  instead  of  probability  measures  to  represent  the  truthfulness  of  propositions 
(nodes).  Furthermore,  the  influences  of  nodes  upon  other  nodes  are  obtained  by  modifying 
these  angular  measures  via  rotations  of  the  associated  truth  state  vectors.  These  rotations  are 
accomplished  through  the  ordered  actions  of  unitary  rotation  operators  upon  the  truth  state 
vectors  for  the  associated  child  variables  and  eliminates  the  need  for  probabilistic  propagation 
of  updates  within  the  network. 

For  a  Dirac  network  an  initial  estimate  of  the  truthfulness  for  each  node  (proposition)  in  the 
underlying  network  is  made  by  assigning  an  angle  0^  <  9  <  90®  to  define  each  of  the  associated 
truth  state  vectors.  The  effects  of  influences  between  nodes  are  contained  within  the  rotation 
angles  a  that  are  assigned  to  each  arc  in  the  network.  Unhke  Bayesian  networks  -  which  require 
a  probabihstic  form  for  propagating  influences  (and  updates),  Dirac  networks  only  utihze  simple 
(ordered)  vector  rotations  to  achieve  the  update.  In  addition,  this  rotation  permits  a  level  of 
fleodbility  that  is  not  available  in  conventional  Bayesian  networks  -  the  rotation  angle  a  can 
assume  any  functional  form  without  concern  for  violating  system  normalization  requirements. 
Either  “prior”  or  “posterior”  values  for  9  may  be  used  when  computing  0-dependent  a  angles 
via  the  proper  utilization  of  order  indexing  for  the  associated  rotation  operators  (when  posterior 
0’s  are  used,  the  influencing  nodes  are  updated  before  the  influenced  nodes).  The  use  of  order 
indexing  yields  a  procedure  that  is  analogous  to  A-  and  vr-message  passing  in  belief  networks. 
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Some  of  the  major  differences  between  Dirac  networks  and  Bayesian- type  networks  are 
summarized  in  the  following  table. 

Dirac  Networks 

oangular  values  assigned  to  nodes  define  truth  as  a  vector 
oorder  indexed  rotation  operators  propagate  influences 
opropagation  model  is  unconstrained 
oone-/bi-directional/cyclic  propagation  permitted 
(one-directional  demonstrated) 


Bayesian  Networks 

oprior/conditional  probabilities  assigned  to  nodes 
oA  and  tt  messages  propagate  influences 
oprobability  calculus  is  required  for  propagation 
oone-directional  and  bi-directional  propagation  is  possible 


|-> 


l  +  > 

Figure  4.1-1.  Truth  Vector  Space  Showing  Truth  State  For  Variable  6 

4.1.3  A  Comparison  With  SIAM 

As  a  first  case,  comparison  is  made  between  the  results  obtained  using  DIRACNET  and  the 
SIAM  system  [9]  for  the  two  very  simple  networks  specified  with  their  prior  probability  values 
in  Figure  4.1-2  (net  1)  and  Figure  4.1-3  (net  2).  SIAM  uses  prior  probability  assignments  to 
network  nodes  and  both  a  positive  influence  (labelled  g  in  Figures  4.1-2  and  4.1-3)  and  a  negative 
influence  (labeled  h  in  Figure  4.1-2  and  4.1-3)  between  nodes.  These  influence  values  are  related 
to  conditional  probabilities  and  the  positive  (negative)  influence  expresses  how  likely  it  is  that 
the  influenced  node  will  be  changed  if  the  influencing  node  is  true  (false) .  After  the  network  has 
been  processed,  the  nodes  that  are  influenced  will  have  updated  marginal  probability  values. 

The  algebraic  expression  for  the  Dirac  truth  state  for  net  1  is  (rotation  operator  application 
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order  doesn’t  matter  since  the  influence  angles  are  constant) : 

^neti  =  ^xb(29.70)Rbc(i1.7°)|540)|45°)|36^) 

=  |54^)|l5.3°)|24.3^). 

Based  upon  this,  DIRACNET  yielded  posterior  probabilities  of  0.4,  0.83,  and  0.73  for  nodes 
A,  B,  and  C,  respectively.  These  values  agreed  with  the  SIAM  results  to  within  the  limits  of 
numerical  roundoff  accuracy. 

The  algebraic  expression  for  the  Dirac  truth  state  for  net  2  is  (again,  application  order 
doesn’t  matter): 

mnet  2  =  Rad  (9")  Rbd  (9")  Rod  (9°)  |45«^)  |450  )  145°^)  |450^) 

=  |45°)|450)|45?;)|i82,). 

DIRACNET  provided  posterior  probabilities  of  0.5,  0.5,  0.5,  and  0.80  for  nodes  A,  B,  C,  and 
D,  respectively.  The  posterior  probabilities  generated  by  SIAM  are  0.5,  0.5,  0.5,  and  0.789  for 
nodes  A,  B,  C,  and  D,  respectively.  This  small  discrepancy  is  likely  due  to  integer  roundoff 
used  in  DIRACNET  calculations. 


g  =  0.6 


h  =  0.7 


p  =  0.5  p  =  0.6 

g  =  0.3 
h  =  0.4 


B 


Figure  4.1-2.  A  Simple  Probability  Network 


Figure  4.1-3.  A  Simple  Probability  Network 


4.1.4  A  Dirac  Network  For  A  Cruise  Missile  Search  And  Destroy  Mission 

This  example  shows  how  the  Dirac  method  can  be  applied  to  problems  of  military  interest. 
In  order  to  illustrate  this,  a  simplified  version  of  a  cruise  missile  search  and  destroy  mission  has 
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been  selected.  In  particular,  a  cruise  missile  is  assumed  to  be  seeking  a  mobile  target  in  varying 
terrain  and  variable  weather  conditions.  The  problem  is  to  construct  a  Dirac  network  which 
will  use  observational  evidence  to  predict  the  probability  that  the  target  will  be  successfully 
engaged  by  the  missile.  It  is  assumed  that  the  search  area  is  partitioned  into  search  regions. 
The  Dirac  network  employed  in  this  illustration  is  designed  to  use  only  one  of  many  possible 
search  patterns  available  to  a  cruise  missile  in  order  to  find  a  mobile  target  in  one  of  these 
regions,  i.e.  region  A.  In  practice,  the  search  evaluation  would  clearly  involve  all  regions  in 
the  partition  and  might  involve  determining  the  probabilities  for  engagement  for  a  variety  of 
possible  search  patterns  that  can  be  used  by  the  missile  in  order  to  find  the  target.  Slave  Dirac 
networks  for  each  region  and  search  pattern  combination  would  be  concurrently  processed  in 
order  to  supply  their  associated  engagement  probabilities  to  a  master  Dirac  network  which 
would  make  the  final  recommendations  for  allocating  missiles  to  search  areas  and  the  search 
patterns  that  would  be  best  to  use. 

The  logical  variables  and  influences  for  the  Dirac  network  in  this  example  are  shown  in  the 
schematic  of  Figure  4.1-4.  There,  integers  juxtaposed  to  each  logical  assertion  are  the  node 
labels  for  nodes  associated  with  the  assertions.  Using  this,  we  may  readily  write  the  following 
algebraic  expression  for  this  Dirac  network’s  truth  state  vector: 

l^r)  =  d[1t]  \di)  \92)  |03)  1^4)  I^S)  1^6)  1^7)  l^s)  l^s)  l^io)  l^ll)  , 

where  the  system  influences  are  imposed  by  the  composite  rotation  operator  according  to  the 
ordering  (order  matters  because  0j-dependent  rotation  angles  will  be  used) 

d[fl]  =  Rio, 11  (aio.ii) -^.lo  (0:9,10) -^8,9  (0:8,9) -R", 10  (0:7,10) -^,8  (o^e.s) 

R5,7  (0:5,7)  .^,9  (0:4,9)  R4,7  (0:4,7)  (0:3,6)  R2,8  (0:2,3)  -^2,5  (0:2,5) 

Ri,5  (0:1,5)  ■ 

Here,  6i  is  the  prior  angular  setting  for  the  variable  (node)  and  ai^j  is  the  rotation  angle  for 
the  influence  of  the  variable  upon  the  variable. 

This  is  the  general  truth  state  manipulated  and  evaluated  by  DIRACNET.  As  a  repre¬ 
sentative  test  case,  the  logical  variables  (nodes)  for  the  network  were  initialized  with  angular 
settings  according  to  the  following  table  which  also  includes  the  prior  expected  truth  values  for 
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each  variable: 


i 

(node  index) 

ei 

(degrees) 

fri) 

(prior  value) 

1 

50 

-0.174 

2 

40 

0.174 

3 

0 

1.000 

4 

30 

0.500 

5 

45 

0.000 

6 

25 

0.643 

7 

45 

0.000 

8 

45 

0.000 

9 

45 

0.000 

10 

45 

0.000 

11 

45 

0.000 

Note  that  the  node  3  angular  setting  of  0°  insures  that  the  assertion  that  “search  pattern  no. 
1  is  selected”  is  precisely  true.  Also,  the  assignment  of  6i  =  45*^  indicates  that  there  is  initially 
total  uncertainty  concerning  the  truth  of  the  associated  logical  variables. 


The  influence  angles  aij  that  were  used  are  summarized  in  the  next  table.  Included  is  a 
column  which  indicates  how  the  influence  angle  was  obtained  from  the  value  in  the  table.  All 
but  one  of  the  influence  angles  (aa^g)  that  were  applied  via  the  associated  rotation  operators 
were  obtained  by  using  a  scaled  initial  input  value  ajj  =  7j  •  where  j  is  the  initial  input 
value,  aij  is  the  angle  used  in  the  rotation,  and  7^  is  a  sigmoid  scale  factor  given  by 

Here,  /3  is  a  factor  which  controls  the  slope  of  the  transition  region  of  the  sigmoid  function  so 
that:  (1)  the  closer  6i  is  to  45*^  for  the  influencing  node  (i.e.,  the  more  uncertain  it  is),  the 
closer  the  angle  is  to  0°;  (2)  if  the  influencing  node  is  almost  certainly  false,  the  scale  factor 
will  be  close  to  —1,  thereby  changing  the  sign  of  with  little  impact  upon  its  magnitude; 
and  (3)  if  the  influencing  node  is  almost  certainly  true,  the  angle  ccij  will  not  change  in  sign  or 
appreciably  in  magnitude.  The  use  of  such  a  scaled  value  is  indicated  by  “scaled”  in  the  “form 
of  influence”  column  of  the  table.  The  remaining  angle  of  influence  is  an  unsealed  application 
of  the  input  value  so  that  ag^g  =  0:3  g-  This  is  denoted  by  “unsealed”  in  the  “form  of  influence” 
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column. 


i 

(parent  index) 

j 

(child  index) 

(degrees) 

Form  of 

Influence 

1 

5 

10 

scaled 

2 

5 

10 

scaled 

2 

8 

10 

scaled 

3 

6 

10 

unsealed 

4 

7 

20 

scaled 

4 

9 

-10 

scaled 

5 

7 

20 

scaled 

6 

8 

10 

scaled 

7 

10 

10 

scaled 

8 

9 

10 

scaled 

9 

10 

10 

scaled 

10 

11 

10 

scaled 

Note  the  sign  difference  between  7  and  a'^  g.  The  purpose  of  this  is  to  account  for  the  fact 
that  adequate  cover  increases  the  truth  of  the  assertion  that  the  “threat  is  in  region  A”  but 
decreases  the  truth  of  the  assertion  that  the  “threat  is  detectable  in  region  A.” 

Clearly,  the  order  of  rotation  operator  application  is  important  for  this  problem  since  all 
but  one  influence  angle  is  functionally  dependent  upon  its  associated  parent  truth  state.  It  is 
obvious  from  a  study  of  the  direction  of  the  arcs  connecting  the  variables  in  Figure  4.1-4  and 
the  order  specification  d[1t]  that  the  influences  are  propagating  through  the  system  starting 
from  the  variables  at  the  bottom  of  the  figure  upwards  to  the  top-most  variable  11.  The 
posterior  values  for  each  6i  obtained  after  propagating  the  influences  (via  vector  rotations)  in 
one  direction  from  the  root  nodes  through  the  network  to  node  11  are  given  in  the  following 
table.  Also  tabulated  are  the  posterior  expected  truth  values  for  each  logical  variable. 


i 

(node  index) 

e. 

(degrees) 

(posterior  value) 

1 

50 

-0.174 

2 

40 

0.174 

3 

0 

1.000 

4 

30 

0.500 

5 

45 

0.000 

6 

15 

0.866 

7 

26 

0.616 

8 

28 

0.559 

9 

45 

0.000 

10 

36 

0.309 

11 

36 

0.309 

As  can  be  calculated  from  this  table,  the  “logic-based”  certainty  for  the  system  provided  by 
the  system  truth  operator  is  (f'j  =  (^)  Ei=i  i^i)  =  0.378.  This  suggests  that  additional 
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information  about  system  variable  truth  states  and  influences  which  would  increase  the  value 
for  (r'j  would  be  useful  (this  is  always  the  case!).  Also,  as  seen  from  this  table,  (tio)  =  (r u)  = 
0.309.  This  implies  that  the  assertions  “the  threat  is  engaged”  and  “the  mission  is  successful” 
are  more  true  than  false.  If  needed,  the  probabilities  pio  and  pn  for  the  truth  of  these  assertions 
are  easily  computed  from  the  posterior  values  for  0io  and  using  the  theory  developed  above: 

Pio  =  Pii  =  cos^  36®  =  0.654. 

Thus,  based  upon  this  formulation  of  the  problem,  if  the  probability  of  success  pu  is  acceptable 
to  decision  makers,  then  it  is  reasonable  for  them  to  allocate  a  missile  to  search  region  A 
using  search  pattern  no.  1.  Of  course,  evaluations  based  upon  all  other  region/search  pattern 
combinations  might  suggest  another  better  allocation  strategy. 


Figure  4.1-4.  A  Cruise  Missile  Search  Pattern  Decision  Network 


4.1.5  A  Genetics  Application 

As  a  final  illustration  of  Bayesian  inferencing,  a  genetics  application  discussed  by  Jensen[?] 
is  modelled  and  solved  using  DIRACNET.  The  problem  may  be  simply  stated  as  follows:  Given 
that  a  horse  has  manifested  a  genetic  disease,  determine  which  ancestors  are  carriers.  The  DAG 
for  this  problem  is  shown  in  Figure  4.1-5,  where  letters  A  through  K  represent  the  assertions 
“horse  A  carries  the  gene,”  etc.,  and  the  arcs  connect  offspring  with  their  parents.  There,  J  and 
K  have  been  crossed  to  produce  a  horse  that  has  the  disease  which  can  be  carried  as  a  recessive 
gene.  Since  neither  J  nor  K  manifest  the  disease  but  have  produced  an  offspring  that  does, 
then  both  J  and  K  are  known  to  carry  the  gene  recessively.  Given  this  information,  a  Dirac 
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network  can  be  constructed  and  used  to  determine  how  likely  it  is  that  the  genetic  ancestors  of 
J  and  K  are  carriers  of  the  gene.  Note  that  although  the  causal  mechanism  is  in  the  direction 
from  ancestor  to  child,  the  implication  mechanism  -  as  shown  by  the  DAG  -  is  in  the  opposite 
direction. 

The  Dirac  truth  state  for  this  problem  is 

1^)  =  d[1^]  \6a)  \6b)  \ec)  \0d)  \0e)  I^f)  I^g)  \0h)  \9i)  \ej) , 

where  d\tt]  is  the  composite  rotation  operator  ordering  given  by 

d[fi]  =  Rie  (aiE)  Rib  (a/s)  Rhd  {oihd)  R-hc  [ochc)  Rgc  (acc)  Rgb  (o^gb) 

RfB  (apB)  RfA  (ctFA)  RkI  {otKl)  RkH  {(AKh)  RjG  {oiJg)  RjF  (o:jf)  ■ 

This  system  was  initialized  in  DIRACNET  with  prior  values 

6a  =  9b  =  9c  =  9d  =  9e  =  9f  =  6g  =  9h  =  9i  =  89° 

and 

9j  =  9e  =  0°. 

These  assignments  reflect  the  facts  that  horses  A,  B,  C,  D,  E,  F,  G,  H,  and  I  are  initially  assumed 
to  most  probably  not  carry  the  gene  and  that  horses  J  and  K  are  known  to  carry  the  gene.  All 
of  the  rotation  angles  used  to  propagate  the  influences  used  the  same  functional  form  given  by 

=  d-25°, 

where  X  and  Y  are  variable  (node)  labels  defined  by  'R,  and  6'x  is  the  posterior  angle  in  degrees 
for  variable  X  (consequently,  the  order  of  rotation  operator  application  is  relevant). 

A  single  propagation  pass  through  the  Dirac  network  in  the  direction  of  the  associated  DAG 
yielded  results  comparable  to  those  given  by  Jensen[?].  These  results  are  tabulated  below: 


X 

(node  index) 

(degrees) 

Posterior  Probability  That 

Assertion  X  is  True 

A 

63 

0.21 

B 

11 

0.96 

C 

37 

0.64 

D 

63 

0.21 

E 

63 

0.21 

F 

42 

0.55 

G 

42 

0.55 

H 

42 

0.55 

I 

42 

0.55 

J 

0 

1.00 

K 

0 

1.00 

Additional  useful  information  was  provided  by  DIRACNET  concerning  the  informational 
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uncertainty  associated  with  the  network  (recall  from  above  that  a  ratio  of  von  Neumann  en¬ 
tropies  provides  a  measure  of  informational  uncertainty).  For  this  problem,  DIRACNET 
yielded  the  following  uncertainties  for  variable  sets  S,  T,  and  U : 

As  =  0.67,  S  =  {A,  B,  C,  D,  E} 

At  =  0.99, T  =  {F,G,H,I} 

Au  =  Q.00,U  =  {J,K}. 

These  values  indicate  that  the  information  associated  with  variable  set:  (1)  U  is  completely 
certain  (recall  that  Au  =  0.00  is  total  certainty);  (2)  T  is  most  uncertain  (recall  that  At  =  1-00 
is  total  uncertainty);  and  (3)  S  is  “moderately”  certain.  Note  that  these  uncertainties  are 
consistent  with  the  6'x  angles  and  posterior  probabilities  in  the  last  table.  More  importantly, 
observe  that  item  (2)  implies  that  additional  information  concerning  the  variables  in  set  T 
would  likely  provide  a  “better”  overall  result. 


4.2  SYSTEMS  ANALYSIS 

In  this  section  we  illustrate  how  the  Dirac  algebra  based  methodology  can  be  used  to  support 
system  design  trade-off  studies.  We  will  also  develop  and  illustrate  a  perturbation  theory  which 
can  be  used  for  certain  system  sensitivity  analyses. 
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4.2.1  A  Simple  System  Design  Trade-ofF  Study 

Consider  the  following  simple  model  of  a  military  weapon  subsystem:  A  track  file  is  prepared 
for  a  set  of  targets  from  the  data  gathered  by  a  group  of  sensors.  This  track  file  is  used  to 
prepare  a  plan  for  a  collection  of  weapons  in  order  that  they  may  engage  the  targets.  Suppose 
we  wish  to  use  the  Dirac  algebraic  method  to  contrast  the  performance  of  systems  comprised 
of  two  such  subsystems:  (1)  when  the  subsystems  are  permitted  to  operate  autonomously;  and 
(2)  when  the  two  subsystems  are  centrally  coordinated.  We  will  first  apply  this  approach  to 
describe  general  truth  state  vectors  and  associated  performance  parameters  that  we  will  use  for 
the  analysis  of  these  systems.  Then  we  shall  discuss  in  closed  form  a  special  case  where  specific 
influences  are  assigned  to  the  parent-child  pairs  associated  with  each  system.  Let  us  begin  by 
examining  a  single  subsystem. 

4.2.1. 1  The  State  Vector  And  Performance  Parameters  For  A  Single  Subsystem 

We  model  a  single  subsystem  by  using  an  influence  network  consisting  of  three  variables 
which  make  assertions  about  the  track  file,  the  engagement  plan,  and  the  weapon-on-target 
engagements.  The  variable  indices  and  associated  assertions  are  given  as  follows: 

Variable  Index  Assertion 


1  The  track  file  quality  for  all  targets  is  good. 

2  A  good  engagement  plan  is  generated  and 

distributed  to  the  weapons  in  time. 

3  All  targets  are  successfully  engaged  and 

neutralized. 

The  prior  truth  states  for  these  variables  are  j^i),  |^2)  and  l^s),  where  the  subscripts  are 
the  variable  indices.  Furthermore,  assume  that  variable  1  exerts  an  qi2  influence  upon  variable 
2  and  that  variable  2  exerts  an  023  influence  upon  variable  3.  Then  the  truth  state  vector  for 
this  single  system  is 

1$)  =  d[7^]  |0i)  1^2)  |03> 

=  Rl2iai2)R23{o!23)  1^1)  1^2)  1^3) 

=  1^1)  1^2  -  ai2}  1^3  -  0:23) , 

where  d[7&]  means  that  the  specified  order  of  application  of  the  rotation  operators  is  important 
(so  that  012  and  023  are  0i-dependent  and  02-dependent  rotation  angles,  respectively).  When 
expanded  in  terms  of  truth  basis  vectors,  this  becomes 

1$)  =  cos  01  cos(02  -  012)  cos(03  -  023)  1+  -k  -!-)  -t- 
cos  01  cos(02  —  012)  sin(03  -  023)  |-t-  H — )  -b 
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cos  01  sill(02  -  C»:i2)  cos(03  -  023)  |H - r)  -r 

sin 01  cos(02  -  ai2)  cos(03  -  a23)  |-  +  +)  -r 

cos  01  sin(02  -  ai2)  sin(03  -  0:23)  [H - )  + 

sin 01  cos(02  -  ai2)  sin(03  -  0:23)  |  -  +  — )  + 

sin  01  sin(02  -  q;i2)  cos(03  -  023)  | - h)  + 

sin 01  sin(02  -  0:12)  sin(03  -  a23)  | - ) . 

We  will  use  the  post-state  expected  value  for  the  truth  operator  fs  for  variable  3,  i.e. 

{ts)  =  (^3  —  a23|  T'a  1^3  -  0^23) , 

as  a  measure  of  this  simple  subsystem’s  performance.  Thus,  the  closer  the  value  of  (f3)  is  to  1 
(— 1)  the  more  true  (false)  the  assertion  for  variable  3  and  the  better  (worse)  the  subsystem’s 
performance.  We  shall  also  use  the  projection 


prob{+  +  -r) 


=  ($1 +  +)(+  +  + 1^>) 

=  COS^  01  COS^(02  -  ai2)  COS^ (03  -  a23) 


and 


where 


and 


Ai  = 


El 

Elm,. 


El  =  —K  ^cos^  01  Incos^  01  -I-  sin^  0i  Insin^  0i^ 


Eim^  =  Kln2 


as  additional  measures  of  performance.  Clearly,  prob{+  +  -r)  is  the  probability  that  the  sub¬ 
system  is  in  the  most  desirable  truth  basis  state  |-l-  +  +)  where  all  of  the  subsystem  assertions 
are  precisely  true.  If  desired,  other  projection  operators  may  be  used  to  determine  the  proba¬ 
bility  that  the  system  is  in  other  truth  basis  states,  e.g.  it  may  be  of  interest  to  examine  the 
sensitivity  of  the  logical  negation  of  assertion  3  (i.e.,  it  is  false  that  all  targets  are  successfully 
engaged  and  neutralized)  to  changes  in  0i  by  evaluating  prob{+  +  —).  The  quantity  Ai  provides 
a  measure  of  the  uncertainty  -  i.e.  lack  of  information  -  associated  with  the  truth  state  for 
variable  1,  thereby  quantifying  the  quality  of  the  track  files.  Obviously,  the  entropy  for  any 
subset  of  variables  can  be  used  to  provide  other  uncertainty  measures.  For  example,  it  might 
be  of  interest  to  determine  how  (fs)  changes  with  variations  in  the  quality  of  the  engagement 
plan  as  quantified  by  A2  =  E2/E2^^^- 


4.2. 1.2  The  Truth  State  Vector  And  Performance  Parameters 
For  Two  Autonomous  Subsystems 


Consider  now  the  truth  state  vector  for  two  subsystems  of  the  type  described  above  which 
are  operating  autonomously.  Let  =  {1,2,3}  index  the  variables  as  defined  above  for  the  first 
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subsystem  and  let  B  =  {4, 5, 6}  index  an  identical  set  of  variables  for  the  second  subsystem. 
Also,  let  ai2  and  Q23  be  the  influences  in  the  first  subsystem  and  let  0145  and  Q56  be  the 
influences  in  the  second  subsystem.  The  general  truth  state  for  a  system  comprised  of  two  such 
autonomous  subsystems  is 

l$a)  =  Rl2(ai2)R23(tt23)  1^1)  1^2)  i^s)  ■^5(a45)-R56(a56)  1^4)  1^5)  l^e) 

=  i^Ba)  • 

Here  the  subscript  “a”  indicates  “for  the  autonomous  system.”  Thus,  the  truth  state  vector  for 
this  “uncoupled”  system  is  the  tensor  product  of  the  truth  state  vectors  for  each  subsystem. 
We  note  that  this  is  a  general  result  for  a  system  comprised  of  a  finite  number  of  autonomous 
subsystems.  In  particular, 

•  If  :  1  <  i  <  is  a  set  of  truth  state  vectors  for  n  autonomous  (non-interacting) 

subsystems  which  comprise  a  system,  then  =  iV’i)  1^2) '  ‘ '  I'^n)  truth  state 

vector  for  the  system. 

As  before,  we  will  use  the  post-state  expected  value  for  a  system  truth  operator  T  = 
i  (f3  -t-  fe)  for  variables  3  and  6  to  evaluate  the  system’s  performance: 

(^)a  ^  \  ^  • 

We  shall  also  use  prohai-r  -(-  -i-  -I-  -h-r)  and  Ai4„  as  additional  performance  measures,  where 


pro6(-f  -r  -H  +  +d-)a 


(’4'al  -f  -r  -b  +  ^+)  (4-  -f  -r  -f  +-t-  l^-a) 
{^A^+  +  ^)A.A.  (+  +  +l^/lj- 
(^Bal  +  +  (+  +  +  I^Ba) 

pro6^„(-r  4-  +)prohBA+  +  +) 


and 


Ei 

max 

EU  +  -E'4a 


2Kln2 

1  /  Eu 


Ea. 

2  V«ln2  Kin 2 
=  2  ■ 


Observe  that  if  these  two  subsystems  are  identical,  i.e.. 


then 


01  =  04,^2  =  ^5,^3  =  ^61^12  =  a45i  and  a23  =  055, 
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probA^  (-f  +  +)  =  probB^  (+  +  +)=  prob{+  + -r)  => 

probai+  -r  +  -r  ++)  =  \prob{+  +  , 

and 

Ai„  =  A4„  =  Ai  =>  Ai4^  =  Ai. 

These  properties  have  obvious  generalizations  for  systems  comprised  of  a  finite  number  of 
identical  autonomous  subsystems.  This  suggests  that  certain  aspects  of  the  behavior  of  such 
systems  may  be  ascertained  from  the  study  of  a  single  subsystem  of  the  system. 


4. 2. 1.3  The  Truth  State  Vector  And  Performance  Parameters  For  Two 
Coordinated  Subsystems 


Now  let  the  system  be  such  that  it  consists  of  a  pair  of  subsystems  as  just  described  with 
an  additional  variable  indexed  by  7  which  coordinates  the  plaiming  of  the  two  subsystems  via 
the  influences  aj^  and  aj^.  Specifically,  let  variable  7  make  the  following  assertion:  “T/ie 
engagement  plans  for  each  subsystem  are  perfectly  coordinated” 

For  the  purpose  of  illustration,  we  choose  the  rotation  operator  application  order  such  that 
the  general  truth  state  vector  for  this  system  is 

IFc)  =  Rl2{0^12)R72{0!72)R23(0i23)  i^l)  l^z)  l^s)  ’ 

=  RA5{<^i5)R75(0‘75)^6{0‘56)  1^4)  l^s)  l^s)  I^t)  > 

where  the  subscript  “c”  indicates  “for  the  coordinated  system.”  The  associated  performance 
parameters  of  interest  are 

(r)^  =  5((r3)e+(f6)c). 

probc{+  -r  +  +  +  +  -f)  =  probA^{+  +  +)probB^{+  +  +)prob7{+), 

and 

Ai4c  =  Ai4„. 

This  last  relationship  between  uncertainty  parameters  results  from  the  fact  that  decision  vari¬ 
ables  1  and  4  are  independent  in  both  the  autonomous  and  coordinated  systems.  Also,  observe 
that  if  072  =  CK75  =  0,  i.e.,  there  is  no  coordination,  then  {9j\  Fc)  =  |^a)- 


4.2. 1.4  A  Specific  Instance 

In  this  section,  we  will  examine  the  performance  of  the  two  systems  described  above  when 
the  associated  influences  are  defined  as  follows: 

012  =  —^1, 
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Q!23  =  -^2, 

<^45  =  ~^4, 

(i56  =  —^5^ 

and  „ 

TT 

0:72  =  «75  =  -  ^t)- 

Here  it  is  assumed  that  the  “coordinator”  exerts  the  same  influence  over  both  engagement  plans 
and  the  “strength”  of  the  influence  resides  in  the  magnitude  of  the  scaling  parameter  /?.  These 
influence  assignments  imply  the  following  intuitive  relationships: 

•  The  quality  of  the  engagement  plans  varies  directly  with  the  quality  of  the  track  files; 

•  The  effectiveness  of  the  engagements  varies  directly  with  the  quality  of  the  engagement 
plans;  and 

•  The  quality  of  the  engagement  plans  varies  directly  with  the  quality  of  the  coordination. 
Using  these  assignments  in  the  performance  parameters  for  the  autonomous  system  yields: 

(t'^  =  —  [cos^(0i  +  ^2  +  ^3)  ~  sin^(0i  +  ^2  +  ^3)  + 

cos^(04  +  05  +  06)  “  sin^(04  +  05  +  06)]» 

pro6a(+ +  +  + +d-)  =  cos^01  cos^(01  +  02)  cos^(01  +  02  +  03)  • 

COS^  04  COS^(04  +  05)  COS^(04  +  05  +  06), 

and 

Ai4„  =  —  (— j- — )(cos^  01  Incos^  01  +  sin^  0i  Insin^  0i  + 
cos^  04  In  cos^  04  +  sin^  04  In  sin^  04) . 

Similarly,  for  the  coordinated  system  we  obtain: 

(f)^  =  ^{COS2[01  +02  +  03  -/3(|- 07)]- 
sin^[0i  +  02  +  03  -  -  ^7)]  + 

COs2[04  +  05  +  06  -/?(f  -07)]- 
sin^[04  +  05  +  06  -  p(^  -  07)]}, 

TT 

pro&c(+ +  +  +  +  + +)  =  cos^0i  cos^[0i  +  02  — /3(— —  07)]  • 

cos^[01  +  02  +  03  -  p(^  -  07)]  • 
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COS^  04  COS^[04  -T  05  -  -  ^*7)]  • 

COS^[04  +  05  +  ^6  ~  /?(‘2  —  ^7)]  ■ 
COS^  07, 


and 

Ai4^  =  Ai4„. 


4.2. 1.5  A  Performance  Comparison  Of  The  Autonomous  And  Coordinated 
Systems  Using  The  Truth  Operator 

Here  we  characterize  the  relative  performances  of  the  autonomous  and  coordinated  systems 
using  the  ratio  of  (f)  and  (f\  .  These  values  indicate  how  true  -  on  average  -  the  assertions 
are  that  are  made  by  variables  3  and  6.  Thus,  we  have  a  relative  measure  of  how  successful  the 
weapon-on-target  engagements  are  for  each  system. 

To  better  understand  the  performance,  let  us  parameterize  the  results  in  terms  of  sums  of 
the  truth  angles  using  the  following: 


7  =  01  -I-  02  +  03 


and 

77  =  04  -f-  05  -H  00- 

Also,  for  the  coordinated  system,  let  us  assume  that  the  assertion  made  by  variable  7  is  com¬ 
pletely  true  so  that  07  =  0.  Thus,  the  engagement  plans  for  each  of  its  subsystems  are  perfect 
and  we  say  that  the  coordinator  is  perfect.  We  shall  use  /3  as  the  parameter  which  controls 
how  influential  this  perfect  coordination  effort  is  upon  each  subsystem’s  separate  engagement 
•  plan.  In  particular,  there  is  no  coordination  of  subsystem  plans  when  /3  =  0  and  complete 
coordination  when  /3  =  1.  Thus,  0  <  /?  <  1  reflects  the  level  of  coordination  between  these  two 
extremes. 

Using  this  parameterization  yields  the  following  ratio: 

(t') 

^  =  {cos2[7-/?(|)]-sin2[7-/?(f)]  + 

cos2[T/-/3(|)]-sin2[77-/3(|)]}/ 

{cos^  7  —  sin^  7  4-  cos^  77  —  sin^  77}. 

Several  observations  can  readily  be  made  from  this  equation: 
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When  /?  =  0,  then 


(iX 

{A 


1. 


This  suggests  the  intuitively  pleasing  conclusion  that  when  coordination  is  non-existent 
the  coordinated  system’s  performance  is  identical  to  that  of  the  autonomous  system,  even 
when  the  coordinator  is  perfect. 


•  When 


and 


0</3<  1, 


.TT, 


0<7<l3i-), 


,7r. 


then,  from  the  door-stop  rule,  we  have 

A.  _ 


If) 

\  /a 


cos^  7  —  sin^  7  cos^  rj  —  sin^  t) 


n 

>  1  when  0  <  7, 77  <  — 

<  1  when  —  <  7, 77  <  — . 


Thus,  we  conclude  that  the  performance  of  the  coordinated  system  is  superior  to  that  of 
the  autonomous  system  for  all  0  <  /3  <  5  since 


^  _  /  TT  V  TT 

0  <7,77  <,0(-)  <  - 


If) 

\  /  c 

If) 

\  la 


>  1. 


This  suggests  that  “small”  to  “moderate”  levels  of  subsystem  coordination  enhance  the 
coordinated  system’s  performance  over  that  for  the  autonomous  system  when  the  imper¬ 
fections  in  the  two  subsystems  are  also  at  “small”  to  “moderate”  levels.  We  also  conclude 
that  the  performance  of  the  coordinated  system  is  exceptionally  superior  to  that  of  the 
autonomous  system  for  all  5  <  /?  <  1  since 

TT  \  TT 

4  <7.»<«2)<2=^ 


(A  <  “  (A = ' 
(A 


<0. 


Note  that  in  this  7,77  regime  the  performance  of  the  autonomous  system  is  poor  since 
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(f'j  <  0  while  that  of  the  coordinated  system  is  still  perfect  since  (t'^  =1.  This 
suggests  that  “significant”  levels  of  subsystem  coordination  can  be  used  to  enhance  the 
coordinated  system ’s  performance  over  that  of  the  autonomous  system  when  the  subsystem 
imperfections  are  also  “significant”  -  provided  that  the  coordinator  is  perfect. 


4.2.1. 6  A  Performance  Comparison  Of  The  Autonomous  And  Coordinated 
Systems  Using  Projections 


Clearly,  |+-r  +  +  ++)  and  |-r  +  +  +  H--r+)  are  the  most  desirable  of  all  possible  truth 
basis  states  for  the  autonomous  and  coordinated  systems,  respectively.  We  shall  refer  to  these 
states  as  the  exalted  states  for  these  systems.  When  these  systems  exist  only  (i.e.,  with  unit 
probability)  in  these  exalted  states  they  are  functioning  perfectly.  However,  when  imperfections 
exist  in  these  systems  (i.e.,  when  variable  truth  states  are  superpositions  of  |+)  and  |— )),  then 
the  truth  state  vectors  for  these  sj^tems  become  superpositions  of  other  truth  basis  states  as 
well.  In  this  case,  these  systems  possess  non-vanishing  probabilities  that  they  can  exist  in  other 
truth  basis  states  in  addition  to  the  exalted  states.  Thus,  the  more  probable  it  is  that  these 
systems  are  in  exalted  states,  the  better  their  performance  is  expected  to  be. 

Here,  we  will  use  the  expressions  for  pro6a(++++++)  and  proi)c(+  +  +  -r-f +  -f-)  developed 
above  from  the  associated  projections  to  evaluate  the  performance  of  these  systems  from  this 
perspective.  Let 

C  —  +  ^2, 

^  =  C  +  ^3, 

X  =  6/i  +  9$, 

K  =  A  -f-  ^6, 

and  consider  the  ratio 

^  pro6c(+  -b-h  -H  -b  +  -h) 

~  proba{+  -H  -I-  -H  -I— r) 

=  {cos^[C  -  /?(|  -  ^r)]  cos^[^  -  -  O7)]  • 

cos^[A  -  /?(^  -  ^7)]  cos2[k  -  -  9j)]  cos^  e^]/ 

{cos^  C,  cos^  t?  cos^  A  cos^  k}. 

The  following  observations  are  worthy  of  note: 

•  If  /3  =  0  (i.e.,  when  there  is  no  coordination  of  subsystem  plans),  then  we  conclude 

0  <  r  =  cos^  9^  <  1. 
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As  expected,  this  suggests  that  vrithout  coordination,  it  is  more  probable  for  the  au¬ 
tonomous  system  to  exist  in  its  exalted  state  than  it  is  for  the  coordinated  system  to 
exist  in  its  exalted  state.  However,  if  the  coordinator  is  perfect  (i.e.,  Oj  =  0^,  then  these 
probabilities  will  always  be  equal. 

•  When  0  <  /3  <  1,  07  =  0  (i.e.,  the  coordinator  is  perfect), 

0<i9</?(|), 


and 


0<k</3(|), 


then,  from  the  door-stop  rule,  we  have 


r  = 


> 


_ 1 _ 

cos^  cos^  cos^  A  cos^  k 

1  when  0  <  C,  0,  A,  K  <  — . 


(Clearly,  when  0  <  t?,k  <  /?(§),  then  0  <  C,A  <  /?(f)  because  0  <  03,^6)-  Thus,  we 
conclude  that  for  all  levels  of  coordination  between  the  extremes  of  no  coordination  and 
complete  coordination,  the  probability  for  the  coordinated  system  to  be  in  its  exalted  state 
is  greater  than  that  for  the  autonomous  system.  This  suggests  that  “very  small”  to  “nearly 
complete”  levels  of  coordination  can  be  used  to  increase  the  probability  of  existence  in  the 
exalted  state  for  the  coordinated  system  relative  to  that  for  the  autonomous  system  when 
the  subsystem  imperfections  are  “small”  to  “moderate”  in  magnitude  and  the  coordination 
is  perfect. 


4.2. 1.7  Changes  In  The  Performance  Of  The  Qutonomous  And  Coordinated 
Systems  Due  To  Variations  In  The  Target  Track  File  Uncertainty 


As  a  final  illustration,  we  will  examine  one  of  the  more  obvious  relationships  between  entropy 
and  system  performance.  In  this  model,  the  uncertainty  Ai4„  =  Ai4^  =  A14  quantifies  the 
quality  of  the  target  track  files  for  the  autonomous  and  coordinated  systems.  Let  us  assume 
that  the  prior  states  for  variables  2,  3,  5,  and  6  are  perfect  (i.e.,  02  =  03  =  ^5  =  06  =  0)  and  that 
each  subsystem’s  track  file  is  of  identical  quality,  i.e.  0i  =  04  =  (p.  Furthermore,  assume  that 
for  the  coordinated  system,  the  quality  of  the  “coordinator”  is  degraded  by  track  file  quality  so 
that  07  =  (p.  Then, 

cos^  (p  —  sin^  p 
cos  2p, 


(f)^  =  cos2[(l+^)^-/3(|)]-sin2[(l  +  ^)^-/3(^)] 
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=  cos[2(H-/3)i^-/37r], 

and  ^ 

Ai4  =  -)(cos^^lncos^(^-r  sin^  V^lnsin^  (p). 

in  2 

Hence,  as  then  (t^  -1,  — >  —1,  and  A14  ^  1  as  |  (the  track  file  becomes 

more  uncertain)  and  A14  — »  0  as  (/p  |  (the  track  file  becomes  more  certainly  of  poor  quality). 
This  suggests  the  obvious  conclusion  that  decreasing  track  file  quality  generally  degrades  the 
performance  of  both  the  autonomous  and  coordinated  systems. 

Now  observe  that  (t^  and  (t^  have  periods  tt  and  (y^))  respectively,  and  (t^  is  phase 
shifted  (towards  increasing  p)  relative  to  (t^  by  (y+ls)  means  that  for  small  (3  and  p 

values,  the  track  file  quality  is  still  very  good  and  >  {t^  .  This  suggests  that  for  “small” 
decreases  in  track  file  quality  “small”  levels  of  coordination  may  enhance  the  performance  of  the 
coordinated  system  over  that  of  the  autonomous  system.  However,  for  small  p  there  is  a  (small) 
for  which  (t'^  >  (t'^  (e.g.  for  =  0,  (t^  =  +1  and  =  cos[— /?7r]  <  +1).  This 
suggests  that  “over-coordination”  when  all  subsystems  have  a  high  quality  track  file  can  decrease 
the  performance  of  the  coordinated  system  relative  to  that  of  the  autonomous  sj/stem.Similarly, 
for  a  moderate  p  value,  there  is  a  moderate  /3  =  fio  for  which  (r'j  >  (^)  for  /?  <  (Sq  and 

(f'^  >  (f'j  for  P  >  Po  (®-g-  (^  =  15°  =>  Ai4  =  .3546,  Pq  =  0.4).  We  can  conclude  from 
this  that  moderate  coordination  of  a  system  using  a  track  file  of  moderately  degraded  quality  can 
increase  the  system ’s  performance.  Although  care  must  be  taken  when  drawing  conclusions  from 
such  an  analysis,  it  is  easy  to  see  that  the  period  and  phase  difference  between  (t'^  and  (t^ 
will  produce  conditions  which  yield  the  reasonable  conclusion  that  “over-coordination”  based 
upon  low  quality  track  file  information  can  yield  a  “precipitous”  reduction  in  the  performance 
of  the  coordinated  system. 


4.2.2  System  Sensitivity  Analyses 


4.2.2. 1  A  Static  Perturbation  Theory  For  Truth  States 


In  this  section  we  will  develop  a  first-order  static  (time-independent)  perturbation  theory 
for  truth  states.  Although  this  theory  is  general,  its  primary  utility  is  for  the  estimation  of  the 
changes  in  the  value  of  the  truth  state  and  expected  truth  values  for  nodes  induced  by  small 
changes  in  variable  truth  angles  and  influence  angles  for  systems  which  have  constant  influence 
angles  or  influence  angles  with  weak  functional  dependencies  upon  parent  truth  angles. 


Consider  first  the  effect  of  a  small  rotation  69 j  upon  a  truth  state  \6j): 

Risej)\ej)  =  \ej-69j) 

=  \9j) 

w  {lj  +  i-6ej-aj)\ej) 
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\ej)  -h  i  ■  66 j  •  cTj  \ej) . 


Here  we  use  a  “generic”  rotation  operator  R{66j)  to  initiate  our  development.  Now  let 


(4-1) 


SO  that  in  general 


+  =  \ey-6ei)\e2  -  662)---\en-69n) 

K,  (l^i)  +  i66iai  |^i))(|^2)  + 1662^2  1^2))  •  •  • 
{\9n)+i6en^n\Gn))- 


To  first-order  in  66 j  this  becomes 


~  \6i)\62)---\6n)+i^66ja,\6,)\4>^) 

j-l 

«  m  +  if266jaj\6j)\cj>j), 
i=i 

or 


1^^)  =  |4'  +  53')-l^)  (4-2) 

n 

J=1 

where  is  the  first-order  change  in  the  truth  state  \^)  due  to  the  small  rotations  60 j  and 

!</-,■)  =  101)1^2) 

Note  that, in  general,  each  child  66 j  can  be  dependent  upon  the  associated  perturbations  in  its 
parent  variables. 


Let  us  now  examine  how  a  small  change  in  a  truth  state  effects  the  expected  value  for  the 
associated  truth  operator.  Since  dj  is  Hermitean,  then  to  first-order  in  66j 

{6j  -  66j\fj  \6j  -  66j}  «  ((0,1  -  f  •  66j  ■  {6j\ <7, Of, (10,)  +  i  ■  50,-  •  d,- 10,)) 

~  (0,i  f ,'  1 0, )  +  i  •  66 j  •(0,1  [f  ,■ ,  (f ,]  1 0, ) 

«  (f,)-ri-50j-([f,-,dj]), 


or 

6{fj)  =  {6j-66j\fj\6^-66j)-{fj)  (4-3) 

~  i-66j-{[fj,&j]). 


Here,  6  (f,)  is  the  (first-order)  change  in  the  expected  truth  value  for  variable  j  due  to  the  small 
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rotation  66 j.  Recall  from  lemma  13  that  [fj,o'j]  =  tTj,  where  itj  is  the  permutation  operator 
for  variable  j.  Using  this  result  in  Equation(4-3)  yields 

6  (fj)  «  2  •  66 j  ■  (tTj)  . 


Application  of  lemma  14  to  this  expression  5delds 

6{fj)  «  2-66j-ATj 

«  4  •  66 j  ■  sin  6j  cos  6j 
S3  2-66j-sm26j. 

Observe  from  this  that  6  (fj)  is  a  weighted  truth  uncertainty  for  the  variable. 


Let  T  be  the  generalized  truth  operator  for  a  system  with  truth  state  vector  |'®')  given  by 
Equation(4-l).  Then  to  first-order 


m+{6^\)fm  +  m) 

('$'1  f  I'yp)  -f  (^1  f  t  I’j') 


so  that 


+  +  -($|f  l^') 

{^\f\6<i!)  +  {6^\t\^), 


where  6  ('5'|  T\^)  is  the  change  in  the  expected  value  for  the  general  truth  operator  produced 
by  a  change  |^^')  in  the  truth  state  |5').  To  evaluate  each  term  on  the  right  hand  side  of  the 
last  expression,  we  use  Equations (4-1)  and  (4-2)  to  obtain 


Therefore, 


{^\f\6^)-r{6^f\^) 


if2^6j{<l>j\{6j\[f,a,]  |0j)|.^j) 

J  =  1 

IC  .  , 

3=1 

|^50j(7rj) 

^ 

j-1 


6{^\f\^) 


l±66,.Ar, 

j=l 
9  ^ 

—  ^2  sin  20j . 

i=i 
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Note  from  this  that  (5  ('5'|  T  is  a  weighted  sum  of  logical  variable  truth  uncertainties. 


4.2.2. 2  An  Illustration:  The  Cruise  Missile  Search  And  Destroy  Problem 

Let  us  revisit  the  previous  example  which  showed  how  the  Dirac  method  can  be  applied 
to  the  cruise  missile  search  and  destroy  allocation  problem.  Recall  that  although  the  influence 
angles  for  this  system  are  functionally  dependent  upon  the  parent  variable  truth  angles,  this 
dependence  appears  in  the  argument  of  the  exponential  of  the  associated  sigmoid  scaling  factors 
7j.  These  scaling  factors  are  consequently  fairly  insensitive  to  small  changes  in  parent  truth 
angles  and  can  be  treated  as  strict  functions  of  the  associated  unperturbed  parent  truth  angles. 
Thus,  perturbation  theory  can  be  easily  employed  in  a  straightforward  manner  to  this  system 
for  the  purpose  of  sensitivity  analysis. 

Suppose  that  one  wishes  to  know  how  sensitive  the  overall  quahty  of  the  system  defined  by 
Figure  4.1-4  is  to  small  changes  in  the  influence  of  “cover  for  threat”  (variable  4).  This  can  be 
estimated  by  assuming  that  the  influence  angles  0:47  and  049  are  changed  by  the  amounts  89-j 
radians  and  69g  radians,  respectively.  Then,  using  the  posterior  values  for  dy  and  ^g,  we  find 
that 

6  (f  7)  «  2  •  60j  ■  sin  29 j 

w  2  •  69j  ■  sin  52'^ 

«  1.576 -(507, 

^  ('^9)  2  •  89g  ■  sin  2^9 

«  2  •  56>9  •  sin  90'^ 

«  2  •  69g, 

and 

6  {^\f\^)  ^  ^  [^^7  •  sin 52°  -r  69^  ■  sin 90°] 

«  ^  [0.788  •  697  +  599] . 

Thus,  positive  changes  in  the  influence  of  the  “cover  for  threat”  variable  upon  variables  7 
(“threat  in  region”)  and  9  (“threat  is  detectable”)  will  increase  the  expected  truth  values  for 
variables  7  and  9,  as  well  as  for  the  entire  system.  For  example,  if  697  =  699  =  5®  «  .0873 
radians,  then  6  (f  7)  «  +0.138,  6T9  «  +0.175,  and  ^  {’F|  T  |4')  «  +0.028. 

4.2.3  Data  Representation  and  Analysis 

In  this  section  we  will  illustrate  the  utility  of  the  Dirac  method  for  data  representation  and 
analysis  by  applying  it  to  two  suggestive  examples:  (i)  a  rotational  symmetry  analysis  of  a 
simple  digital  image;  and  (ii)  creating  a  fused  observational  error  profile  using  an  e  state. 
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4.2.3.1  Finding  Rotational  Symmetries  In  A  Simple  Digital  Image 
Consider  the  following  square  8x8  pixel  digital  image: 
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(4-4) 


and  partition  it  into  a  2  x  2  quartered  image  such  that  each  4  pixel  x4  pixel  quarter  image  is 
ordered  with  indices  1,2,3, 4  as  follows: 


1  2 
4  3 


(4-5) 


Here 
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(4-6) 


Cl]=[i]= 
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and  1  (on)  and  0  (off)  signify  binary  pixel  values. 

For  the  quarter  image,  let  the  probability  Pj,i(o)  that  a  pixel  is  on  (off)  in  the  quarter 
image  be  ^x[the  number  of  I’s  (O’)  in  the  quarter  image]  and  define  a  truth  state  \j)  for 
the  quarter  image  as 

ii)  =  +  ^/^\~)j  ■’j  ^  {1>2,3,4}, 

where  \  j)  represents  the  truth  state  for  the  assertion  “all  pixels  are  on  in  the  quarter  image.” 
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Using  these  definitions,  we  have 


ii)  =  -Y[l+)i-l-)i], 

|2)  =  |-)2, 

|3)  =  ^[K)3-M-)3L 

and 

i4>  =  |-)4, 

so  that  a  truth  state  |  for  the  quarted  image  can  be  represented  as  the  tensor  product 


1^)  =  |1)  12)13)14) 

=  ^0+ -+-)  +  !--—)  + 

- )]■ 


For  the  purpose  of  our  simple  symmetry  analysis,  suppose  we  ask:  “does  the  original  image 
(4-4)  have  a  ^,n  €  {1,2,3},  (clockwise)  rotational  symmetry  about  its  center  when  viewed 
as  a  (coarser  grained)  quartered  image?"  Observe  that  the  such  rotation  of  the  quartered 
image  with  order  defined  by  Figure  (4-5)  is  permutation  7r„  of  the  order  indices  in  Figure  (4-5). 
In  particular,  application  of  TTn  to  Figure  (4-5)  produces 


and 


4  1 
3  2 


when  n  =  1, 


3  4 
2  1 


when  n  =  2, 


2  3 
1  4 


when  n  =  3. 


Consequently,  each  rotation  written  in  Cayley  cycle  notation  is 


TTi  =  (1432), 


TTa  =  (13)(24), 

and 

7r3  =  (1234). 

These  permutations  can  be  defined  as  operators  TTn  {not  to  be  confused  with  the  permutation 
operator  itj  for  the  truth  state)  which  act  upon  l^f)  by  acting  upon  each  of  its  four  basis 

ket  vectors  by  permuting  the  order  of  their  ±’s  (e.g.,  tti  |4 - ! — )  =  j — ! - h) ,  etc)  so  that 

TTn  I'J')  =  I'S'n)  IS  the  truth  state  for  the  associated  rotated  quartered  image.  Thus,  the  expected 
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value 

{itn)  =  (5'|7r„l3') 

is  the  autocorrelation  function  which  quantifies  the  similarity  between  the  image  rep¬ 

resented  by  1^)  and  its  rotation  I'J'n)-  The  closer  the  value  of  (7r„)  is  to  1  (0),  then  the  greater 
(less)  the  similarity  -  or  correlation  -  between  the  images  represented  by  |^)  and  l^n)  and  the 
better  (worse)  the  associated  rotational  S3unmetry  (obviously  the  n  —  A  rotation  is  the  identity 
permutation  (1)(2)(3)(4)  so  that  (5^  |'J'4)  =  (5'  j'J')  =  1  and  the  correlation  is  perfect). 

Let  us  now  calculate  each  (tt^)  and  use  its  value  to  identify  the  rotation  which  provides  the 
best  correlation  between  the  quartered  image  and  its  associated  rotation.  For  n  =  1,  we  have: 

=  ^[^1  I+-+-) - )  + 

7ri|---r-)-b7ri| - )] 

=  - ^ - ^ 

!_  +  __)  +  ! - )] 

so  that 

(tTi)  =  (VPITTI  I'J') 


1 

4’ 


i.e.,  I’F)  and  |^'i)  have  only  1  (| - })  of  the  4  possible  basis  kets  in  common.  For  n  =  2, 

^2  1 5')  =  ^[^2|+-+-)+7r2|H - )  + 

7r2| - H-)  -t-7r2| - )] 

=  2[l"* - ^ — )  +  l - ^ — )  + 

!+___)  +  ! - )] 

so  that 

(^2)  =  ^[(+ - +-!+-+-)  + (+ 

4 
4 

=  1, 

i.e.  1 5')  and  |^'2)  have  all  4  basis  kets  in  common  and  the  correlation  is  perfect.  Similarly,  for 


4-27 


NSWCDD/TR-01/103 


n  =  3, 

^3|^)  =  ^[|---+)-|-  +  — )- 

I - +  I - )] 

and 

(n)  =  j( - 1 - ) 

1 

4’ 

i.e.  1$)  and  l^s)  have  only  the  basis  ket  | - )  in  common. 


Since  (^2)  =  1,  we  conclude  that  since  the  quartered  image  exhibits  a  perfect  (^'l  ^'2)  cor¬ 
relation,  then  the  image  (4-4)  possesses  a  coarse  grained  7r2  (i.e.  a  180°  rotational)  symmetry. 
However,  it  is  easy  to  see  that  the  original  image  (4-4)  does  not  possess  an  exact  (non-coarse 
grained)  180°  rotational  symmetry  because  the  1®‘  3’"'^  quarters  in  the  rotated  original  image 
are  , _ , 
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which  are  -  themselves  -  180°  rotations  of  the  1®‘  and  3’''^  quarters  of  the  unrotated  original 
image. 


We  can  use  the  same  quartering  approach  to  find  coarse  grained  symmetries  in  each  quarter 
of  image  (4-4),  as  well  as  in  each  quarter  of  each  of  these  quarters.  Let’s  use  the  1®^  quarter  of 
image  (4-4)  given  by  subimage  (4-6)  to  illustrate  this.  Using  the  same  ordering  scheme  as  (4-5) 
we  have  for  this  subimage 


and 


[n=iz]= 


[i]=[±]= 
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1 

1 

1 

1 

Then  the  truth  state  |4>)  for  this  subimage  is  the  single  ket 
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so  that 


(TTi)  =  ($|7ri|$) 

=  (--i - r|7ri|-T - h) 

=  0, 

(7r2)  =  ($1 7r2 1$) 

=  (-H - l-|7r2|-H - r) 

=  (-  +  -+!-  +  --) 

=  1, 

and 

(^3)  =  ($1  ^3 1$) 

=  - - ^-) 

=  (-  +  --K-+-) 

=  0. 

We  conclude  from  this  that  the  1®*  quarter  of  image  (4.-4)  also  exhibits  a  coarse-grained  it 2 
symmetry.  By  observation,  it  is  easy  to  see  that  the  3’’'^  quarter  of  this  image  also  exhibits  a 
coarse-grained  it 2  symmetry  and  that  the  2”'*  and  4**  quarters  exhibit  all  coarse-grained  tti, 
it 2,  and  #3  symmetries. 

Let  us  now  examine  the  non-coarse-grained  pixel  level  symmetries  for  each  quarter  of  the 
1®*  subimage  quarter.  Using  again  the  order  defined  by  (4-5),  it  is  easy  to  see  that  the  truth 
states  for  each  subimage  quarter  1  —  4  are 

|1)  =  1 - ), 

|2)  =  |-r  -i-  d-r)  , 

|3)  =  I - >, 

and 

|4)  =  !+  +  ++). 

It  is  easy  to  see  that  for  this  subimage 

{k\  itj  \k)  =  1,  j  G  {1, 2, 3}  ,  fc  €  {1, 2, 3, 4}  . 

In  a  similar  manner,  it  can  be  readily  shown  that  the  same  value  for  {kj),j  G  {1,2,3},  is 
obtained  for  each  quarter  of  the  remaining  three  subimage  quarters  of  (4-4) .  We  conclude  from 
this  that  each  quarter  of  every  subimage  quarter  of  (4-4)  exhibits  pixel  level  itj,j  G  {1,2,3} , 
symmetries. 
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These  results  are  summarized  in  the  rotational  symmetry  quad-tree  of  Figure  4.2-1  which 
represents  the  ^,n  €  {1,2,3}  ,  rotational  symmetries  exhibited  by  image  (4-4)  when  examined 
from  different  levels  of  image  refinement.  Here,  the  root  vertex  of  the  tree  represents  the  original 
image  and  the  label  “2”  indicates  that  when  -  quartered  and  rotated  about  its  center  -  it  exhibits 
an  associated  coarse  symmetry.  The  leaf  vertices  in  the  tree  represent  the  finest  pixel  level 
quartering  resolution  possible  for  the  image  -  each  vertex  represents  such  a  quartering  -  and 
the  labels  1,2,3  on  each  of  them  indicates  that  each  such  quartering  exhibits  all  7ri,7r2,  and 
TTa  (exact)  symmetries  when  rotated  about  their  centers.  The  four  middle  layer  vertices  in  the 
tree  represent  quarterings  of  each  quarter  in  the  original  image  and  their  labels  correspond  to 
their  associated  rotational  symmetries.  Edges  in  the  tree  connect  quarterings  of  quarters  and 
for  each  such  quartering  the  left- most  vertex  represents  quarter  1  as  defined  by  (4-5) ,  the  next 
left-most  represents  quarter  2,  etc.  An  analysis  of  this  symmetry  quad-tree  suggests  that  the 
image  consists  of  diagonal  arrangements  of  squares. 


Figure  4.2-1.  A  Rotational  Symmetry  Quad-Tree 


4. 2. 3. 2  Observational  Error  Fusion  Using  An  g  State 

Let  M  be  a  surveillance  instrument  which  c£in  measure  two  signatures  si  or  S2  that  can  be 
used  to  distinguish  two  categories  of  flying  aircraft.  The  measurement  errors  associated  with 
each  signature  are  normally  distributed  with  mean  values  si  and  S2  and  standard  deviations  ai 
and  a2,  respectively,  such  that  5i  <  52,  »  (T2,  and 

|si  -S2I  <  min  {(71,0-2}. 

This  condition  ob-viously  indicates  that  serious  aircraft  classification  difficulties  can  arise  when 
measurements  from  M  are  used  for  this  purpose.  Fortunately,  these  signature  measurements 
are  sensitive  to  the  elevation  0  at  which  they  are  performed.  Only  the  S2  signature  can  be 
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observed  when  the  aircraft  is  observed  at  M’s  zenith  and  only  the  si  signature  can  be  observed 
when  the  aircraft  is  fljdng  at  very  low  elevations  near  M’s  local  horizon.  It  is  also  known  that 
Si  signatures  are  increasingly  more  likely  to  be  observed  than  S2  measurements  as  0  — >  0®  and 
that  it  is  equally  likely  for  si  or  S2  to  be  observed  if  the  measurement  is  made  at  local  elevations 
near  45°. 

This  behavior  of  the  observational  errors  for  M  can  be  modelled  using  the  system  ket 

1$)  =  .Ri3(ai3)43(a23)|0i>|02)|03;t^’3^’*^)  (4-7) 

=  l^l)  1^2)  1^3  —  (Q:13  -t-  ^23))  , 

where  61  is  the  current  observational  elevation,  82  is  the  elevation  of  the  previous  observation, 
Q!i3  =  —81,  0:23  =  82,  and 

=  COS6I3  |-h)3  ItAa')  +sin6»3  |-)3  . 

The  prior  values  for  the  truth  angles  are  81  =  82  =  85  =  0  and  the  observation  elevations  81 
and  82  are  assumed  to  be  available  and  frequently  updated.  The  associated  error  profile  (i.e. 
the  probability  distribution  for  measuring  si  or  $2)  is 

|(V?3  =  cos2/3|((^3  |•^/»^l}|2  +  sin2/3|((^3  , 

where  /?  =  ^3  +  (^i  -  6*2)  and  |(¥J3  |^3^)P  and  |(9?3  are  the  Gaussian  observational  error 

profiles  for  Si  and  S2,  respectively. 

To  see  how  changes  with  elevation  angle,  let  us  iterate  it  through  several  observational 
cycles.  Denoting  the  state  and  elevations  for  the  iteration  by  |5'fc)  and  9^  we  have: 

1^1)  = 

|(</73|T)i^  =  cos2  6I}  |(</73  IV’aOl^  +  sin^e];  |(VJ3 

K</^3  1^2)1^  =  COS^^'?|(V’3  IV'3">I^ 


|(</73  |vEr)|2  =  cos2  6>^  |(v?3  I^^i)p  +  sin2  0^-|(v?3 

A  schematic  showing  the  evolution  of  several  representative  consecutive  fused  observational 
error  profiles  is  shown  in  Figure  4.2-2. 

Before  closing  this  section  we  note  that  if  the  rate  of  change  8  of  the  observational  elevation 
is  known,  then  the  time  evolution  operator  can  be  used  to  specify  the  observational  error  profile 
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state.  In  this  case,  the  state  is  time  dependent  and  at  any  time  t  is  given  by 

mt))  = 

=  eo~ 

where  Oq  is  the  elevation  of  the  first  observation  made  at  time  to-  If  ^  is  constant,  then 


o<e<7T/4 


s,  s. 


Figure  4.2-2.  A  Schematic  Of  The  Evolution  Of  A  Fused  Observation  Error  Profile 
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5  OPTICAL  COMPONENT  DEVICE  IMPLEMENTATIONS 


In  this  section  we  will  briefly  provide  “thumbnail  sketches”  of  simple  optical  devices  which: 

1.  Produce  e  states  for  a  logical  variable; 

2.  Rotate  child  variable  truth  states;  and 

3.  Measure  the  expected  truth  value  and  probability  distribution  for  a  logical  variable’s  e 
state. 

It  is  not  our  intention  to  suggest  that  these  simple  devices  be  used  for  Dirac  network  ap- 
phcation.  Rather,  we  merely  wish  to  illustrate  that  useful  optical  implementations  of  Dirac 
networks  are  possible.  Practical  general-purpose  implementations  would  use  state-of-the-art 
optical  components  -  along  with  user/computer  control  capabilities  which  would  provide  the 
necessary  flexibility  to  reconfigure  the  network  in  order  to  adapt  it  to  a  wide  variety  of  appli¬ 
cation  domains. 

It  should  be  pointed  out  that  even  though  the  theory  developed  above  is  based  upon  quan¬ 
tum  mechanics,  it  is  also  valid  within  our  context  when  classically  intense  (laser)  light  is  used. 
Consequently,  the  properties  which  we  discuss  for  each  of  these  devices  are  valid  at  both  the 
quantum  level  (i.e.,  for  an  individual  photon)  and  the  classical  level  (i.e.  for  large  ensembles  of 
identically  prepared  photons).  In  what  follows,  we  shall  assume  that  our  light  source  is  a  laser 
which  produces  a  classically  intense,  randomly  polarized  beam  which  exhibits  a  Gaussian  flux 
density  cross  section;  and  each  device  is  oriented  within  the  same  laboratory  reference  frame 
(as  defined  on  accompanying  figures). 

5.1  OPTICAL  e  STATES  FOR  LOGICAL  VARIABLES 

An  optical  z  state  for  a  logical  variable  j  can  be  easily  produced  using  a  laser,  a  polarization 
filter,  and  a  specially  cut  bi-refringent  prism  (the  prism  has  one  facet  cut  at  a  small  angle) .  The 
associated  expected  truth  value  and  probability  distribution  can  be  measured  using  a  slit  screen 
and  a  CCD  photon  detector.  This  apparatus  and  its  arrangement  relative  to  the  laboratory 
reference  frame  is  shown  in  Figure  5.1-1.,  where:  (i)  the  laser  beam's  center  is  aligned  to  lie 
precisely  along  the  z-axis  of  the  frame;  (ii)  the  facets  of  the  prism  are  oriented  as  shown  in  the 
figure  with  the  facet  nearest  the  laser  normal  to  the  beam;  (iii)  the  beam  covers  a  narrow  slit 
oriented  along  the  y-axis  in  the  screen;  and  (iv)  the  CCD  detector  is  centered  in  the  beam  and  is 
oriented  to  sample  the  hght  intensity  along  the  x-axis.  The  linear  polarization  state  of  a  beam 
photon  produced  by  the  polarization  filter  defines  the  truth  state  for  variable  j.  The  natural 
Gaussian  flux  density  profile  of  the  beam  provides  a  spatial  probability  distribution  for  finding 
a  photon  along  the  x-axis  of  the  beam.  The  prism  serves  as  an  optical  Stern-Gerlach  apparatus 
and  entangles  the  polarization  states  with  polarization  dependent  probability  amplitudes.  The 
slit  in  the  screen  serves  to  disperse  photons  along  the  x-axis  in  order  to  produce  a  more  easily 
observed  probability  distribution. 
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After  passing  through  the  polarization  filter,  the  tt  state  for  variable  j  is 

where  is  the  probability  amplitude  for  finding  a  photon  along  the  x-axis  and  \6j)  is  the 
polarization  state  for  a  photon  in  the  laboratory  reference  frame  given  by 


\0j)  =  cos  dj  lx)  -r  sin  Oj  \y) . 


Here,  |x)  and  \y)  are  unit  vectors  along  the  system’s  x-  and  y-axes  and  represent  the  truth  kets 
and  |— )j-,  respectively.  The  angle  6j  is  the  angular  setting  of  the  polarization  filter  measured 
clockwise  (when  looking  along  the  z-axis  from  the  laser  to  the  detector)  from  the  positive  x- 
axis.  After  passing  through  the  prism,  a  photon  is  produced  which  is  in  the  polarization-position 
amplitude  entangled  e  state 

=  cos0j  |x)  -i-  sindj  \y)  . 

The  photon  intensity  measurement  by  the  CCD  detector  produces  the  probability  distribution 
profile  (along  the  x-axis)  given  by 

|(x  =  cos^ej\{x  |V’j}|Vsin2  6'j  |(a:  • 

The  system  is  calibrated  by  finding  the  CCD  pixel  numbers  noj  that  locate  the  peaks  of  the 
6j  =  0,  9j  —  and  9j  =  |  intensity  profiles.  These  correspond  to  the  pixel  numbers  for 
expected  truth  values  -f-l,  0,  and  -1,  respectively  (it  is  assumed  that  pixel  numbers  are  counting 
numbers  and  they  increase  in  the  positive  x  direction  such  that  no  >  >  nz).  If  Ua  is  the 

pixel  number  for  the  peak  intensity  for  an  arbitrary  filter  setting  0  <  9j  —  a  <  ^,  then  the 
expected  truth  value  for  the  variable  can  be  calculated  using 


{rj)  = 


no  -  ns 


A  9'j'^  post-selected  e  state  can  be  produced  using  the 
5.1-1  with  an  additional  post-selection  polarization  filter 
the  beam  between  the  prism  and  the  slit.  The  resulting 
Equation(3-9)  with  ipj  replaced  by  x. 


same  apparatus  as  shown  in  Figure 
with  angular  setting  9j  inserted  in 
photon  intensity  profile  is  given  by 
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X 


z 


Reference  Frame  (positive  y-axis  is  perpendicular  to  paper  and 
in  accordance  with  right  hand  screw  rule). 


Laser  Filter  Prism  Slit  Detector 

Figure  5.1-1.  An  Apparatus  For  Creating  And  Measuring  An  s  State 


5.2  TIME-DEPENDENT  OPTICAL  INFLUENCES  OF  A  PARENT  UPON  ITS  CHILDREN 

Consider  an  n  variable  system  in  which  the  time  dependent  variable  1  is  the  only  parent 
for  child  variables  2, 3,  •  •  • ,  n  and  assume  that  simultaneous  time  dependent  post-state  expected 
truth  value  profiles  {fj{t)),  t  >  to,  are  required  for  variables  j  =  2, 3,  •  •  • ,  n.  The  time-dependent 
truth  state  for  such  a  system  is  given  by  the  specification 

=  Ul{t,to;Ol)Rl2{o^l2)Rl3{oilz)  ■  ■  ■  Rln{ctln)  |^l)  1^2)  '  '  ’  l^n)  , 
where  -  for  the  sake  of  simplicity  -  we  will  assume  that  aij  =  —9i  ^  with  'Jij  a  constant. 

An  optical  realization  of  this  system  can  be  achieved  using  a  classically  intense  randomly 
polarized  laser  as  a  photon  source;  a  polarization  filter  with  computer  (programmed  with  the 
dynamics  defined  by  0i)  controlled  time  stepped  setting  di  which  defines  Ui{t,to\  9i)  |^i);  a  1- 
input  port/(n  -  l)-output  port  polarization  preserving  beam  splitter;  n-1  optically  active  light 
guides  tuned  to  perform  the  specified  n  —  1  rotations  Rij[a.ij)-,  and  n  —  1  optically  active  light 
guides  which  induce  the  specified  prior  states  \0j),  j  =  2, 3,  •  •  •  ,n  into  |'S'(t)).  Measurement  of 
the  required  expected  truth  value  time  profile  for  the  child  variable  is  performed  using 

a  computer-controlled  rotation  angle  stepped  polarization  filter  which  operates  in  concert  with 
a  photon-intensity  counter  in  order  to  rapidly  determine  and  record  the  child’s  time-dependent 
filter  setting  which  produces  the  maximum  intensity  at  time  t.  This  setting  corresponds  to 

the  time-dependent  post-state  1^(0)  which  is  used  to  compute  and  record  the  time-dependent 
expected  truth  value  profile  according  to 

{fj{t))  =  cos^  -  sin^  d'j{t). 
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Note  that  this  measurement  apparatus  seeks  a  filter  setting  such  that  the  autocorrelation 


function  (&■ 


e>)  =  1,  i.e. 


O'j.  A  schematic  of  this  apparatus  is  shown  in  Figure  5.1-2. 


Let  us  examine  in  more  detail  how  the  optical  activity  of  the  light  guides  produce  a  rotation 
of  the  child  prior  state  \9j).  The  optically  active  medium  of  the  light  guide  connecting  variable 
1  (input  port)  with  that  of  variable  j  is  characterized  by  a  specific  rotatory  power  6ij  which 
is  the  measure  of  the  change  of  the  input  truth  state  polarization  angle  6i  per  unit  length. 
Consequently,  the  associated  rotation  angle  produced  by  a  light  guide  of  length  Cij  depends 
upon  9i  and  is  given  by  the  linear  expression 


Pu  =  -aii  =  ^1  ±  Sijiij, 


where  -I-  (— )  applies  when  the  optically  active  medium  is  dextrorotatory  (levorotatory) .  If  6j 
is  the  specific  rotatory  power  for  the  optically  active  medium  used  to  produce  the  prior  child 
state  \9j),  then  the  length  £j  of  dextrorotatory  medium  required  to  rotate  an  input  polarization 
vector  through  a  positive  angle  9j  is 

h  c  i 
Oj 

so  that  the  polarization  vector  entering  the  \9j)  child  state  medium  at  angle  leaves  it  after 
traversing  length  £j  at  angle  9j  +  Pi  j-  Therefore, 

\^j)  ~ 

=  \9j  ■^9i±6ij£ij) 

=  \9i+dj£j±6ij£ij) . 

As  an  illustration  of  this,  assume  that  the  laser  produces  6500  wavelength  randomly  po¬ 
larized  light  and  that  the  fight  guide  medium  is  quatrz  which  has  a  specific  rotatory  power 
—  8ij  —  /mm  at  6500  wavelength.  Suppose  that  the  prior  truth  state  for  child  variable  j 
is  |34®y  Then  the  length  of  the  levorotatory  light  guide  medium  needed  to  produce  this  prior 
truth  state  for  variable  j  is 

34° 

- =  2.0  mm. 

^  17’^/mm 

If  the  required  influence  aij  of  variable  1  upon  variable  j  is 

-ai,-  =  /3i^.  =  9i-  8.5°, 

then  the  length  of  levrorotatory  quartz  fight  guide  needed  to  implement  this  linear  relationship 
is 


hj  = 


8.5” 


n^/mm 

Consequently,  post-state  of  variable  j  is  given  by 


=  0.5  mm. 
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For  example,  if  |0i)  =  |lOi),  then  0'-^  =  |35.5°y  This  example  is  shown  in  Figure  5.2  -  2 

where  we  follow  the  linear  polarization  (electric  field)  vector  E  of  the  light  as  it  traverses  the 
system.  There  the  parent  state  |0i)  =  |105)  is  produced  by  a  linear  polarizer  with  angle  setting 
6i  =  10°  relative  to  the  laboratory  reference  frame’s  x-axis.  Light  leaves  the  polarizer  and 


enters  the  light  guide  with  E  such  that  =  arccos 


{A 


JL 

|yl 


=  10°.  As  the  light  traverses  the 


Ij  light  guide,  E  is  rotated  counter-clockwise  (looking  from  the  polarizer  down  the  light  guide) 
and  emerges  after  0.5  mm  with  E  making  an  angle  1.5°  with  the  laboratory  reference  frame’s 
positive  x-axis.  It  then  enters  variable  fs  dextrorotatory  medium  with  this  E  orientation  and 
E  is  rotated  clockwise  as  it  traverses  this  light  guide.  The  light  leaves  variable  fs  light  guide  - 
having  been  rotated  34°  (clockwise)  -  with  E  making  a  final  angle  of  0'  =  35.5°  with  the  x-axis. 


Before  closing,  we  note  that  a  similar  approach  can  be  employed  for  the  case  where  the  roles 
of  parent  and  child  in  Figure  9  are  interchanged  and  there  are  n  —  1  parents  (variables  2  thru 
n)  influencing  a  single  child  (variable  1).  In  this  case,  however,  extra  care  must  be  exercised 
since  the  polarization  states  which  exit  the  jl  light  guides  must  be  (vectorially)  added  along 
with  that  of  the  prior  child  state. 


Figure  5.2-1.  An  Apparatus  For  Creating  And  Measuring  The  Influence  Of  A  Time 
Dependent  Parent  Variable  Upon  Its  Children 
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Figure  5.2-2.  An  Example  Of  Polarization  State  Rotations 
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6  CONCLUDING  REMARKS 


This  report  describes  the  research  and  development  products  generated  during  FYOO  -  FYOl 
that  involve  application  of  the  Dirac  algebra  of  quantum  mechanics  to  methods  of  probabilistic 
inference.  This  work  is  viewed  by  the  authors  of  this  report  as  a  successful  first  step  towards 
the  creation  of  a  viable  and  extremely  useful  approach  for  automatic  probabilistic  reasoning, 
aspects  of  systems  analysis,  and  data  representation  and  analysis.  Additional  related  areas  that 
merit  further  investigation  include  the  following; 

•  the  use  of  complex  valued  probability  amphtudes; 

•  using  quantum  interference  to  increase  the  probabilities  for  “good  answers”  and  decrease 
the  probabilities  for  “bad  answers;” 

•  using  general  angular  momentum  states  for  multi-valued  logic  problems; 

•  the  development  of  a  user-friendly  interface  for  a  DIRACNET  system; 

•  the  development  of  inter-system  interfaces  for  DIRACNET  in  order  to  enable  informa¬ 
tional  updates  for  special  application  areas; 

•  the  development  of  a  complete  optical  implementation  theory;  and 

•  using  DIRACNET  to  solve  additional  and  more  complex  applications  (e.g.,  data  mining, 
data  correlation  and  fusion,  and  temporal  reasoning) . 
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Appendix  A 

AN  OUTLINE  OF  CODE  FOR  THE  EXAMPLES 


A-l/A-2 
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The  purpose  of  the  code  was  to  test  the  behavior  of  Dirac  nets  on  a  group  of  problems  that  were 
simple,  but  large  enough  to  be  too  tedious  to  do  by  hand.  Some  assumptions  were  made  in  the 
interest  of  short  run  times.  One  was  to  avoid  calculating  sines  and  cosines  repeatedly.  The  angle  of  a 
truth  state  vector,  0,  and  the  angle  representing  a  ro^on,  a,  were  restricted  to  integer  values  and  the 
sines  and  cosines  for  all  integer  angles  beteen  0*  and  90'  were  stored  in  a  table.  Code  was  written  in 
ANSI  C.  Several  versions  of  the  program  exist,  usually  to  acommodate  different  methods  for 
deriving  the  a's.  The  code  fragments  here  are  meant  to  display  the  methods  used  without  the  burden 
of  reading  declarations,  allocations  and  input/output  processing  not  relevant  to  the  method  being 
discussed. 

A  list  structure  and  a  pointer  to  it  were  used  to  store  the  rotation  angle  information: 

cypedef  struct  list  *Llst_ptc; 
struct  list 
{  int  influencer; 
int  alpha; 

List_ptr  next; 

)  List; 

The  prior  and  posterior  values  of  9  were  stored  in: 

int  *cheta,  *theta_posC; 

The  sines  and  cosines  of  angles  from  0*  to  90'  were  stored  in: 

double  ‘sc; 

Note  that  the  first  half  of  the  table  held  sines  and  the  second  half  held  cosines. 

First,  information  relevant  to  a  node  was  read  in.  This  included  its  prior  0,  an  indicator  of  how  the  a 
values  were  to  be  computed  or  modified,  the  influencing  nodes  and  the  initial  a’s  (if  appropriate). 
The  a's  were  computed  in  a  variety  of  ways,  and  some  of  the  alternatives  will  be  reviewed  below. 
Once  the  a's  were  computed,  the  posterior  0's  were  calculated  in  a  routine  called  update_angles; 

void  update_angles  (int  ‘theta,  int  ‘thetajiost,  int  node,  List_ptr  ‘head_infl_list) 

{  int  angle; 

List_ptr  x; 
int  j; 

j  =  node; 

{  angle  =  * (theta  +  j) ; 

X  =  *{head_infl_list  +  j); 
vtile  (X  !=  NIL) 

{  angle  =  angle  -  x->alpha; 

X  =  x->ne)C:; 

) 

if-  (angle  >  90)  angle  =  90; 
if  (angle  <  0)  angle  =  0; 

•(  theta jjost  +  j)  =  angle; 
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) 

) 

As  the  code  indicates,  each  influencing  parent  node  simply  contributes  an  a  in  an  additive  fashion  to 
the  total  influence  on  the  child  node,  subject  to  the  doorstop  rule  that  contrains  0  to  lie  between  0' 
and  90*. 


The  expected  tnith  values  for  both  the  prior  and  posterior  values  of  the  0's  are  computed  using  the 
prior_truth  and  post_mith  routines: 

void  prior_tnith  (int  no_nodes,  int  *theta,  double  *sc,  inc  no_angles,  FILE  *outfile) 

{ 

double  expec_tnith,- 

double  s,  c;  /*  Note:  sin  is  at  i  and  cos  is  at  i+no_angles  in  sc  */ 

int  i; 
int  t; 


) 


for  (i  =  0;  i  <  no_nodes;  i+t-) 

{  t  -  * (theta  +  i) ; 
e  =  *  (sc  +  t) 

c  =  *(sc  +  t  no_angles); 
expec_truth  =  c*c  -  s*s; 

fprintf  (outfile,-  theta(%2d)  =  t2d  prior  truth  val  = 
i+1, t,  expec_truth) ; 

) 


void  post_ttuth  (int  no_nodes,  int  *theta_post.  doiible  ‘sc,  int  no_angles,  FILE  *outfile) 
double  expec_truth; 

double  s,  c;  /*  Note:  sin  is  at  i  and  cos  is  at  i+no_angles  in  sc  */ 

int  i; 
int  t; 


) 


fprintf  (out file, ’in*)  ; 

for  (1  =  0;  i  <  no_nodes;  ir+) 

{  t  =  •(theta_post  +  1); 
s  =  *(sc  +  t); 
c  =  *(sc  +  t  +  no_angles); 
expec_tnith  =  c*c  -  s*s; 
fprintf  (outfile,'  theta[%2d)  =  t2d 
i+1, t, expec_truth) ; 

) 


post  truth  val  =  %f\n*. 


Both  these  routines  write  their  results  to  a  formatted  output  file  for  the  analyst. 


Since  it  is  possible  to  examine  different  combinations  of  nodes  to  get  measures  of  interest,  some 
method  of  tracking  the  nodes  of  interest  and  the  corresponding  states  (true  or  false)  of  interest  was 
required.  The  method  chosen  used  two  words,  an  integer  and  an  unsigned  integer,  to  do  the  tracking. 
If  we  use  a  convention  where  we  label  each  bit  from  1  to  32,  starting  at  the  right  or  low-end  bit,  then 
each  bit  can  correspond  to  a  node.  (For  the  experiments  we  were  rumiing,  32  nodes  were  adequate, 
but  if  additional  nodes  were  needed  the  scheme  could  be  extended  to  multiple  words.)  The  variable 
maskin  is  used  to  track  which  variables  are  of  interest  and  the  variable  interest  tracks  whether  the 
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true  or  false  state  of  the  particular  variable  is  of  interest  The  user  provkies  a  list  of  variables,  each 
followed  by  T  or  F,  to  indicate  the  variables  and  states  of  interest  The  following  code  sets  up  the 


appropriate  masks: 

/*  Read  the  number  of  nodes  vhose  state  is  a  state  of  interset  */ 

fscanf  (infile, •%d',&no_of_Jj\terest>; 

node_of__interest  =  (int  *)  calloc(no_of_interest,si2eof (int)); 

/*  Based  on  the  number  of  nodes  and  nodes  of  interest,  set  up  a  mask  for  use  in  ket_prod  */ 

/*  Note  that  a  node  of  interest  can  be  of  interest  when  it  is  in  either  the  T  of  P  state.  */ 

/*  Tliis  is  indicated  hy  the  character  (T  or  F)  followoing  the  node  number.  */ 

nask  =  (unsigned  int  *)  calloc{no_nodes,  sizeof  (unsigned  int)); 
for  (i  =  0;  i  <  nojiodes;  i4-f) 

♦(mask  i)  =  pow(2,i); 


fprintf  (outflle, 'Xn  The  nodes  of  interest  and  their  state  of  interest  are:\n"); 

interest  =  0;  maskin  s  O; 

for  (i  =  0?  i  <  no_of^interest;  i+^l 

(  fsccinf  (infile,  "td  %ls*,  anode,  &char_tf); 

fprintf  (outfile, "  %d  %c\n',  node,  char_tf[0J); 

*  (node_of^interesC  +•  i)  *  node; 
temp  =  pow(2, (node  -  1) ) ; 
maskin  s  maskin  I  t^ip; 
if  (char_tf(0]  =  'T') 
interest  «  interest  |  temp; 

} 

no^states  =  pow(  (double) 2,  (double)  (no_nodes  -  no_o£_lnterest) ) ; 
max_stat€s  *  pow(  (double)2,  (double)no_nodes) ; 

Depending  on  the  study  being  conducted,  some  of  this  code  could  be  eliminated  or  simply  hard¬ 
wired.  if  that  proves  more  convenient. 


Once  the  nodes  and  states  of  interest  are  established,  the  ket_prod  routine  is  called.  This  computes 
the  probability  amplitude  for  the  combination  of  nodes/states  of  interest.  If  the  user  is  interested  in 
those  states  where,  for  example,  two  nodes  are  both  true,  this  routine  (with  the  above  initialization) 
can  provide  the  result  See  Section  n  C.  Note  that  there  is  also  a  routine  called  all_prod  which 
computes  die  probability  amplitudes  for  all  states.  This  can  be  of  interest  for  small  nets,  but  since 
the  number  of  states  goes  as  2N ,  the  volume  of  output  can  rapidly  become  too  large  to  provide  much 
in»ght 


void  ket_prod  (int  ‘thetajost,  int  no_podes,  int  iiiax_states,  int  interest,  unsigned  int  *nask, 
double  *prod_sum,  int  inas)d.n,  int  no  angles,  double  *sc) 

{ 

int  angle; 
int  i,  j; 
int  offset; 
int  temp,  teBp2; 
int  state_val; 
double  prod; 


♦prod^um  «  0.; 

for  (i  *  0;  'i  <  maiq_states;  i+'»') 
{  tenp  s  maskin  &  i; 
if  itsss^  ==  interest) 

C  state_val  =  i; 
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prod  =  1 . : 

for  (3  =  0;  j  <  no_nodes;  j++) 

{  angle  =  *(theta_post  +  j); 
temp2  =  ‘(mask  +  j)  4  state_val: 
if  (t€5ip2  ==  0) 
offset  =  angle; 
else 

offset  =  angle  +  no_angles; 
prod  =  prod  *  (*(sc  <•  offset)  ),- 

) 

*prod_sum  =  *prod_suffl  +  prod*prod; 

) 

} 

) 


Since  the  von  Neumann  entropy  is  available  (see  Section  II H)  for  both  individual  nodes  and  groups 
of  nodes,  these  calculations  are  performed  in  entropy_caIcl  and  entropy_calc2.  The  entropy  ratio 
(the  uncertainty  associated  with  the  group  of  nodes  of  interest  relative  to  the  entropy  of  the  wdiole 
system)  is  the  measure  of  interest,  rater  than  the  entropy  itself,  so  the  ratio  is  the  result  returned. 
The  code  for  node  groups  follows.  A  similar  routine  does  single  nodes. 

void  entroEy_calc2  (double  ‘entropy,  int  *node_of_interest,  int  no_of_interest, 
double  *entropy_ratio) 

{  int  i; 

int  count,  loc; 
double  sum; 

sum  =  0 .  ; 
count  =  0; 

for  (i  =  0;  i  <  no_of_interest ;  i++) 

(  loc  =  * (node_of_interest  +  i)  ; 
sum  =  sum  i-  *  (entropy  *  loc  -  1) ; 
count+t; 

) 

*  (entropy_tatio)  =  sum/ (count*m2) ; 

Several  different  approaches  were  used  to  set  up  the  rotation  angles,  a.  One  was  to  simply  read  in 
an  estimated  a  and  use  it  directly.  In  the  probability  models,  there  was  a  two-way  conversion 
between  probabilities  and  angular  settings.  The  code  for  this  version  of  program  is: 


void  func6  (List_ptr  *head_jnf l_list ,  int  ‘save,  int  node,  int  icount,  int‘  theta, 
int  ‘theta_post) 

{  int  gi,  hi; 
double  gf,  hf; 

List_ptr  X,  y; 
int  j; 

/‘  If  the  form  of  influence  is  6,  a  list  consisting  of  one  or  more  integers  follows.  The  ‘/ 
/‘  integers  are  the  influencing  nodes  4  the  +  and  -  influencing  factors,  entered  as  integers. ‘/ 
/‘  The  overall  node  ordering  is  inportant  since  it  is  the  updated  angles  that  influence  later‘/ 
/‘  nodes.  ‘/ 

X  =  lallocO ; 

‘ (head_infl_list  +  node)  =  x; 

x->influencer  =  ‘ (save+1) ;  /‘  influencing  node  ‘/ 

gi  =  ‘ (save  +  2 )  ; 
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hi  =  ‘(save  +3);  . 
gf  =  (float)gi/100-:  . 
hf  =  (float)hi/100.; 
gf  =  (float)gi/100,: 
x->g  =  gf; 
x->h  =  hf; 
x->next  =  NIL; 
j  =  4; 

while  (j  <  iccwnt) 

C  y  =  lalloc  0  ; 

•  y->influencer-=  * (save  +  j); 
gi  =  ♦(save  +  j  +  1)  ; 
hi  =  *(saVB  +  j  +  21; 
gf  =  /( float)  gi/ioo . ; 
hf  =  ( float) hi/IOO.; 
y->g  =  gf; 
y->h  =  hf; 
y->next  =  NIL: 
x->next  =  y; 

X  =  y; 
j  =  3  ♦  3; 


) 

) 


void  upaace_angles  (int  •theta,  int  *theta_post,  int  node,  List_i>tr  *head_infl_list) 


C  int  angle; 

int  a2,  infl,  th; 
double  thfl,  aZil,  anglefl; 

Listjptr  x; 
int  . 

j  =  node; 

th  =  ♦(theta  +  j); 

*(theta_post  +  j)  =  th; 
thfl  =  (float) thr 
X  =  * (head_inEl_list  +  j ) ; 
while  (X  !=  NIL) 

{  infl  =  x->influencer,- 

a2  =  *(theta_post  +  infl  -  1); 
a2fl  =  (floatla2;' 

anglefl  =  ((90.  -  a2fl)/90.}  ♦  ((90.  -  thfl)/90.  +  (thfl/90.)*x->g) 

+  (a2fl/90.)  ♦  ((90.  -  thfl)/90.  +  (thfl/90 . ) *x->h) ; 
angle  =  (int) (90.  -  90 . *anylef 1) ; 

X  =  x->next; 

.thfl,.=  90.  -  90.*anglefl:  -  ,  i.  . 

if  (angle  >  90)  angle  ='90; 
if  (angle  <0) wangle  =  0; 

*(theta_post  +  j)  =  angle; 

}  _  , 

This  code  implements  the  process  described  in  Section  IV  C,  Case  1 . 

For  some  of  the  problems  a  sigmoid  function  seemed  the  most  appropriate,  so  that  if  a  parent  is 
uncertain  or  nearly  so,  it  influences  the  child  minimally,  and  if  it  is  very  likely  to  be  true  it  strongly 
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affects  the  child  in  a  postive  (or  negative)  fashion,  and  if  it  is  very  likely  to  be  false  it  affects  the  child 
in  the  opposite  direction.  The  code  to  implement  this  process  is  the  followdng; 

void  func2  (List_ptr  *head_infl_list,  int  ‘save,  int  node,  int  icount,  int*  theta, 
int  *theta_post) 

{  int  tenp,  taii>2,  beta; 
double  scale,  betafloat; 

List_ptr  X,  y; 
int  j; 

/*  If  the  form  of  influence  is  2,  a  list  consisting  of  a  single  beta,  followed  hy  one  or  more*/ 

/*  pairs  of  integers.  Beta  is  entered  as  an  integer  and  interpreted  as  a  percent.  An  entry*/ 

/*  of  50  yields  a  factor  of  0.5.  The  first  of  each  pair  is  the  influencing  (parent)  node.  */ 

/*  Hie  second  of  each  pair  is  the  angle  of  influence,  alpha.  A  sigmoid  funtion  is  computed  */ 

/*  based  on  these  values.  The  entered  alpha  is  modified:  alpha  =  orig. alpha  *  scale.  */ 

/*  scale  =  2*(  (l./(l.  +  e**  (-beta*(45  -  theta_j)arent) ) )  -  0.5)  */ 

/*  Scale  is  thus  a  function  of  the  form  l./(l.+e**x) ,  adjusted  to  range  from  -1.  to  1.  */ 

X  =  lallocO; 

♦ (head_inf l_Jist  *  node)  =  x; 
beta  =  •  (save+1)  ,- 
betafloat  =  ( {double)beta) /lOO. ; 

x->influencer  =  *(save+2);  /*  influencing  node 

temp2  =  *(theta_post  +  x->influencer  -  1); 

tetip  =  *(save+3);  /*  alpha  before  scaling 

scale  =  !./(!.+  exp (-betafloat* (double) (45  -  tempi))); 
scale  =  2.*(scale  -  0.5); 
x->alpha  =  (int) ( scale* temp) ; 
x->next  =  NIL; 
j  =  4; 

while  (j  <  icount) 

(  y  =  lallocO; 

y->influencer  =  *(save  +  j); 
tenpl  =  *  (Checa_post  -i-  y->influencer  -  1)  ; 
teitp  =  *  (save  +  j  *  1) ; 

scale  =  l./(l.+  exp (-betafloat* (double) (45  -  tempi))); 
sceile  =  2.*(scale  -  0.5); 
y->alpha  =  (int) (scale*temp) ; 
y->next  =  NIL; 
x->next  =  y; 

X  =  y; 

j  =  j  +  2; 

) 


Because  many  versions  of  the  program  were  developed  and  one  version  would  usually  cannibalize 
another,  with  appropriate  "commenting  out",  a  complete  listing  is  not  ^ven.  For  those  willing  to 
hack  throu^  experimental  code  in  C,  an  operational  version  can  be  provided  as  a  text  file.  Send 
requests  to  ParksAD@nswc.navy.mil. 


*/ 

*/ 
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